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PREFACE 





This textbook is meant to serve a first course in modern physics, including 
relativity, quantum mechanics, and their applications. Such a course often follows 
the standard introductory course in calculus-based classical physics. The course 
addresses two different audiences: (1) Physics majors, who will later take a 
more rigorous course in quantum mechanics, find an introductory modern course 
helpful in providing background for the rigors of their imminent coursework 
in classical mechanics, thermodynamics, and electromagnetism. (2) Nonmajors, 
who may take no additional physics class, find an increasing need for concepts 
from modern physics in their disciplines—a classical introductory course is not 
sufficient background for chemists, computer scientists, nuclear and electrical 
engineers, or molecular biologists. 

Necessary prerequisites for undertaking the text include any standard calculus- 
based course covering mechanics, electromagnetism, thermal physics, and optics. 
Calculus is used extensively, but no previous knowledge of differential equations, 
complex variables, or partial derivatives is assumed (although some familiarity 
with these topics would be helpful). 

Chapters 1—8 constitute the core of the text. They cover special relativity and 
quantum theory through atomic structure. At that point the reader may continue 
with Chapters 9-11 (molecules, quantum statistics, and solids) or branch to 
Chapters 12—14 (nuclei and particles). The final chapter covers cosmology and 
can be considered the capstone of modern physics as it brings together topics from 
relativity (special and general) as well as from nearly all of the previous material 
covered in the text. 

The unifying theme of the text is the empirical basis of modern physics. 
Experimental tests of derived properties are discussed throughout. These include 
the latest tests of special and general relativity as well as studies of wave-particle 
duality for photons and material particles. Applications of basic phenomena are 
extensively presented, and data from the literature are used not only to illustrate 
those phenomena but to offer insight into how “real” physics is done. Students 
using the text have the opportunity to study how laboratory results and the analysis 
based on quantum theory go hand-in-hand to illuminate such diverse topics as 
Bose-Einstein condensation, heat capacities of solids, paramagnetism, the cosmic 
microwave background radiation, X-ray spectra, dilute mixtures of *He in He, 
and molecular spectroscopy of the interstellar medium. 

This third edition offers many changes from the previous edition. Most of the 
chapters have undergone considerable or complete rewriting. New topics have 
been introduced and others have been rearranged. More experimental results are 
presented and recent discoveries are highlighted, such as the WMAP microwave 
background data and Bose-Einstein condensation. End-of-chapter problem sets 
now include problems organized according to chapter section, which offer the 
student an opportunity to gain familiarity with a particular topic, as well as general 
problems, which often require the student to apply a broader array of concepts or 
techniques. The number of worked examples in the chapters and the number of 
end-of-chapter questions and problems have each increased by about 15% from 
the previous edition. The range of abilities required to solve the problems has been 
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broadened, so that this edition includes both more straightforward problems that 
build confidence as well as more difficult problems that will challenge students. 
Each chapter now includes a brief summary of the important points. Some of 
the end-of-chapter problems are available for assignment using the WebAssign 
program (www.webassign.net). 

A new development in physics teaching since the appearance of the 2" edition 
of this text has been the availability of a large and robust body of literature from 
physics education research (PER). My own teaching style has been profoundly 
influenced by PER findings, and in preparing this new edition I have tried 
to incorporate PER results wherever possible. One of the major themes that 
has emerged from PER in the past decade or two is that students can often 
learn successful algorithms for solving problems while lacking a fundamental 
understanding of the underlying concepts. Many approaches to addressing this 
problem are based on pre-class conceptual exercises and in-class individual or 
group activities that help students to reason through diverse problems that can’t be 
resolved by plugging numbers into an equation. It is absolutely essential to devote 
class time to these exercises and to follow through with exam questions that 
require similar analysis and articulation of the conceptual reasoning. More details 
regarding the application of PER to the teaching of modern physics, including 
references to articles from the PER literature, are included in the Instructor’s 
Manual for this text, which can be found at www.wiley.com/college/krane. The 
Instructor’s Manual also includes examples of conceptual questions for in-class 
discussion or exams that have been developed and class tested through the support 
of a Course, Curriculum and Laboratory Improvement grant from the National 
Science Foundation. 

Specific changes to the chapters include the following: 


Chapter 1: The sections on Units and Dimensions and on Significant Figures 
have been removed. In their place, a more detailed review of applications 
of classical energy and momentum conservation is offered. The need for 
special relativity is briefly established with a discussion of the failures of 
the classical concepts of space and time, and the need for quantum theory is 
previewed in the failure of Maxwell-Boltzmann particle statistics to account 
for the heat capacities of diatomic gases. 

Chapter 2: Spacetime diagrams have been introduced to help illustrate relation- 
ships in the twin paradox. The application of the relativistic conservation 
laws to decay and collisions processes is now given a separate section to 
help students learn to apply those laws. The section on tests of special 
relativity has been updated to include recent results. 

Chapter 3: The section on thermal radiation has been rewritten, and more 
detailed derivations of the Rayleigh-Jeans and Planck formulas are now 
given. 

Chapter 4: New experimental results for particle diffraction and interference 
are discussed. The sections on the classical uncertainty relationships and on 
wave packet construction and motion have been rewritten. 

Chapter 5: To help students understand the processes involved in applying 
boundary conditions to solutions of the Schrodinger equation, a new section 
on wave boundary conditions has been added. A new introductory section 
on particle confinement introduces energy quantization and helps to build 
the connection between the wave function and the uncertainty relationships. 
Time dependence of the wave function is introduced more explicitly at an 


earlier stage in the formulism. Graphic illustrations for step and barrier 
problems now show the real and imaginary parts of the wave function as 
well as its squared magnitude. 

Chapter 6: The derivation of the Thomson model scattering angle has been 
modified, and the section on deficiencies of the Bohr model has been 
rewritten. 

Chapter 7: To ease the entry into the 3-dimensional Schrodinger analysis of 
the hydrogen atom in spherical coordinates, a new section on the one- 
dimensional hydrogen atom has been added. Angular momentum concepts 
relating to the hydrogen atom are now introduced before the full solutions 
to the wave equation. 

Chapter 8: Much of the material has been reorganized for clarity and ease of 
presentation. The screening discussion has been made more explicit. 

Chapter 9: More emphasis has been given to the use of bonding and antibonding 
orbitals to predict the relative stability of molecules. Sections on molecular 
vibrations and rotations have been rewritten. 

Chapter 10: This chapter has been extensively rewritten. A new section on 
the density of states function allows statistical distributions for photons or 
particles to be discussed more rigorously. New applications of quantum 
statistics include Bose-Einstein condensation, white dwarf stars, and dilute 
mixtures of *He in *He. 

Chapter 11: The chapter has been rewritten to broaden the applications of the 
quantum theory of solids to include not only electrical conductivity but also 
the heat capacity of solids and paramagnetism. 

Chapter 12: To emphasize the unity of various topics within modern physics, 
this chapter now includes proton and neutron separation energies, a new 
section on quantum states in nuclei, and nuclear vibrational and rotational 
states, all of which have analogues in atomic or molecular structure. 

Chapter 13: The discussion of the physics of fission has been expanded while 
that of the properties of nuclear reactors has been reduced somewhat. 
Because much current research in nuclear physics is related to astrophysics, 
this chapter now features a section on nucleosynthesis. 

Chapter 14: New material on quarkonium and neutrino oscillations has been 
added. 

Chapter 15: Chapters 15 and 16 of the 2" edition have been collapsed into 
a single chapter on cosmology. New results from COBE and WMAP are 
included, along with discussions of the horizon and flatness problems (and 
their inflationary solution). 


Many reviewers and class-testers of the manuscript of this edition have offered 


suggestions to improve both the physics and its presentation. I am particularly 
grateful to: 


David Bannon, Oregon State University 

Gerald Crawford, Fort Lewis College 

Luther Frommhold, University of Texas-Austin 
Gary Goldstein, Tufts University 

Leon Gunther, Tufts University 

Gary lhas, University of Florida 
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Paul Lee, California State University, Northridge 
Jeff Loats, Metropolitan State College of Denver 
Jay Newman, Union College 

Stephen Pate, New Mexico State University 
David Roundy, Oregon State University 

Rich Schelp, Erskine College 

Weidian Shen, Eastern Michigan University 
Hongtao Shi, Sonoma State University 

Janet Tate, Oregon State University 

Jeffrey L. Wragg, College of Charleston 

Weldon Wilson, University of Central Oklahoma 


I am also grateful for the many anonymous comments from students who used 
the manuscript at the test sites. I am indebted to all those reviewers and users for 
their contributions to the project. 

Funding for the development and testing of the supplemental exercises in the 
Instructor’s Manual was provided through a grant from the National Science 
Foundation. I am pleased to acknowledge their support. Two graduate students 
at Oregon State University helped to test and implement the curricular reforms: 
K. C. Walsh and Pornrat Wattasinawich. I appreciate their assistance in this 
project. 

The staff at John Wiley & Sons have been especially helpful throughout the 
project. I am particularly grateful to: Executive Editor Stuart Johnson for his 
patience and support in bringing the new edition into reality; Assistant Production 
Editor Elaine Chew for handling a myriad of complicated composition and 
illustration details with efficiency and good humor; and Photo Editor Sheena 
Goldstein for helping me navigate the treacherous waters of new copyright and 
permission restrictions. 

In my research and other professional activities, I occasionally meet physicists 
who used earlier editions of this text when they were students. Some report that 
their first exposure to modern physics kindled the spark that led them to careers 
in physics. For many students, this course offers their first insights into what 
physicists really do and what is exciting, perplexing, and challenging about our 
profession. I hope students who use this new edition will continue to find those 
inspirations. 


Corvallis, Oregon Kenneth S. Krane 
August 2011 kranek@physics.oregonstate.edu 
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2 Chapter 1 | The Failures of Classical Physics 


If you were a physicist living at the end of the 19th century, you probably would 
have been pleased with the progress that physics had made in understanding the 
laws that govern the processes of nature. Newton’s laws of mechanics, including 
gravitation, had been carefully tested, and their success had provided a framework 
for understanding the interactions among objects. Electricity and magnetism 
had been unified by Maxwell’s theoretical work, and the electromagnetic waves 
predicted by Maxwell’s equations had been discovered and investigated in the 
experiments conducted by Hertz. The laws of thermodynamics and kinetic theory 
had been particularly successful in providing a unified explanation of a wide 
variety of phenomena involving heat and temperature. These three successful 
theories—mechanics, electromagnetism, and thermodynamics—form the basis 
for what we call “classical physics.” 

Beyond your 19th-century physics laboratory, the world was undergoing rapid 
changes. The Industrial Revolution demanded laborers for the factories and 
accelerated the transition from a rural and agrarian to an urban society. These 
workers formed the core of an emerging middle class and a new economic order. 
The political world was changing, too—the rising tide of militarism, the forces 
of nationalism and revolution, and the gathering strength of Marxism would 
soon upset established governments. The fine arts were similarly in the middle 
of revolutionary change, as new ideas began to dominate the fields of painting, 
sculpture, and music. The understanding of even the very fundamental aspects of 
human behavior was subject to serious and critical modification by the Freudian 
psychologists. 

In the world of physics, too, there were undercurrents that would soon cause 
revolutionary changes. Even though the overwhelming majority of experimental 
evidence agreed with classical physics, several experiments gave results that were 
not explainable in terms of the otherwise successful classical theories. Classical 
electromagnetic theory suggested that a medium is needed to propagate electro- 
magnetic waves, but precise experiments failed to detect this medium. Experiments 
to study the emission of electromagnetic waves by hot, glowing objects gave 
results that could not be explained by the classical theories of thermodynamics 
and electromagnetism. Experiments on the emission of electrons from surfaces 
illuminated with light also could not be understood using classical theories. 

These few experiments may not seem significant, especially when viewed 
against the background of the many successful and well-understood experiments 
of the 19th century. However, these experiments were to have a profound and 
lasting effect, not only on the world of physics, but on all of science, on the 
political structure of our world, and on the way we view ourselves and our place 
in the universe. Within the short span of two decades between 1905 and 1925, the 
shortcomings of classical physics would lead to the special and general theories 
of relativity and the quantum theory. 

The designation modern physics usually refers to the developments that began 
in about 1900 and led to the relativity and quantum theories, including the 
applications of those theories to understanding the atom, the atomic nucleus and 
the particles of which it is composed, collections of atoms in molecules and solids, 
and, on a cosmic scale, the origin and evolution of the universe. Our discussion 
of modern physics in this text touches on each of these areas. 

We begin our study in this chapter with a brief review of some important 
principles of classical physics, and we discuss some situations in which classical 
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physics offers either inadequate or incorrect conclusions. These situations are not 
necessarily those that originally gave rise to the relativity and quantum theories, 
but they do help us understand why classical physics fails to give us a complete 
picture of nature. 


1.1 REWIEW OF CLASSICAL PHYSICS 


Although there are many areas in which modern physics differs radically from 
classical physics, we frequently find the need to refer to concepts of classical 
physics. Here is a brief review of some of the concepts of classical physics that we 
may need. 


Mechanics 


A particle of mass m moving with velocity v has a kinetic energy defined by 
K=5mv (1.1) 
and a linear momentum p defined by 
p=mv (1.2) 


In terms of the linear momentum, the kinetic energy can be written 


Ket (1.3) 


When one particle collides with another, we analyze the collision by applying 
two fundamental conservation laws: 


I. Conservation of Energy. The total energy of an isolated system (on which 
no net external force acts) remains constant. In the case of a collision between 
particles, this means that the total energy of the particles before the collision 
is equal to the total energy of the particles after the collision. 

II. Conservation of Linear Momentum. The total linear momentum of an 
isolated system remains constant. For the collision, the total linear momentum 
of the particles before the collision is equal to the total linear momentum of the 
particles after the collision. Because linear momentum is a vector, application 
of this law usually gives us two equations, one for the x components and 
another for the y components. 


These two conservation laws are of the most basic importance to understanding 
and analyzing a wide variety of problems in classical physics. Problems 1—4 and 
11-14 at the end of this chapter review the use of these laws. 

The importance of these conservation laws is both so great and so fundamental 
that, even though in Chapter 2 we learn that the special theory of relativity modifies 
Eqs. 1.1, 1.2, and 1.3, the laws of conservation of energy and linear momentum 
remain valid. 
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| Example 1.1 





A helium atom (m = 6.6465 x 107’ kg) moving at a speed 
of Vy. = 1.518 x 10° m/s collides with an atom of nitro- 
gen (m = 2.3253 x 10~*°kg) at rest. After the collision, 
the helium atom is found to be moving with a velocity of 
Vie = 1.199 x 10° m/s at an angle of 4, = 78.75° rela- 
tive to the direction of the original motion of the helium 
atom. (a) Find the velocity (magnitude and direction) of the 
nitrogen atom after the collision. (6) Compare the kinetic 
energy before the collision with the total kinetic energy of 
the atoms after the collision. 


Solution 

(a) The law of conservation of momentum for this colli- 
sion can be written in vector form aS Pinitial = Pfinale Which 
is equivalent to 


Py,initial = Py,finat and Py,initial = Py,final 


The collision is shown in Figure 1.1. The initial values 
of the total momentum are, choosing the x axis to be the 
direction of the initial motion of the helium atom, 


Pxinitial = HeVHe and Py initial = 0 
The final total momentum can be written 
/ / 
Pxfinal = MyeVHe COS Pye + MyVy COS Oy 


/ * / . 
Py final = MHeVHe SIN Oy, + MyVy Sin Oy 


The expression for py gna 18 written in general form with 
a + sign even though we expect that 6, and @y are on 
opposite sides of the x axis. If the equation is written in 
this way, 0, will come out to be negative. The law of 


» 


VHe 
or Oo. Xx 
(a) 
Y Pte 
A, 
= i F 
(b) vn 


FIGURE 1.1 Example 1.1. (a) Before collision; 
(b) after collision. 























conservation of momentum gives, for the x components, 
MyeVHe = MpyeViqe COS Oye + MNV_ COSAN, and for the y 
components, 0 = my_Viq_ SiN Oye + MyVy Sin Oy. Solving 
for the unknown terms, we find 


Mie (VHe = Vite cos Ore) 





Vy COS Oy = 
my 
= {(6.6465 x 10-7’ kg)[1.518 x 10° m/s 
—(1.199 x 10° m/s)(cos 78.75°)]} 
x (2.3253 x 10776 kg)! 
= 3.6704 x 10° m/s 
ee MyeV iq SiN OyY6 


My 
= —(6.6465 x 10777 kg)(1.199 x 10° m/s) 
x (sin78.75°) (2.3253 x 10776 kg)7! 


= —3,3613 x 10° m/s 


We can now solve for yy, and Oy: 





rh = OW sin Oy)? + (cos Oy)? 





= (3.3613 x 105 m/s)2 + (3.6704 x 105 m/s)? 


= 4.977 x 10° m/s 

VA, sin 6. 

6 = tan-! “N 'N 
ae Vy COS Oxy 


Le ex x 10° ms) 





= —42.48° 
3.6704 x 10° m/s 


(b) The initial kinetic energy is 
Kinitial = 3 MyeVite 
= 4(6.6465 x 10-77 kg)(1.518 x 10° m/s)” 
= 7.658 «10-7 I 
and the total final kinetic energy is 
Kginat = 3 peVtte + FINN 
= +(6.6465 x 10-77 kg)(1.199 x 10° m/s)? 
+5 (2.3253 x 10-76 kg)(4.977 x 10° m/s)” 
= 7.658 x10 I 


Note that the initial and final kinetic energies are equal. 
This is the characteristic of an elastic collision, in which 
no energy is lost to, for example, internal excitation of the 
particles. 





| Example 1.2 
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An atom of uranium (m = 3.9529 x 10-*>kg) at rest 
decays spontaneously into an atom of helium (m= 
6.6465 x 10-77 kg) and an atom of thorium (m = 3.8864 x 
10-*> kg). The helium atom is observed to move in the 
positive x direction with a velocity of 1.423 x 10’ m/s 
(Figure 1.2). (a) Find the velocity (magnitude and direc- 
tion) of the thorium atom. (5) Find the total kinetic energy 
of the two atoms after the decay. 








(a) | 





/ , 
VTh Vue 


(b) 








FIGURE 1.2 Example 1.2. (a) Before decay; (b) after decay. 


Solution 

(a) Here we again use the law of conservation of momen- 
tum. The initial momentum before the decay is zero, so the 
total momentum of the two atoms after the decay must also 
be zero: 


= _ / ") 
Px initial =0 Px.final = MyeVHe + M™TpYTh 


: _ : ; : 
Setting Py initial = Pxfinal 2d solving for v;,, we obtain 
_ MHeVie 

MT 


(6.6465 x 10777 kg)(1.423 x 107 m/s) 
3.8864 x 10-2 kg 


= —2.432 x 10° m/s 





The thorium atom moves in the negative x direction. 
(b) The total kinetic energy after the decay is: 


K= Ln eVite + LmnMth 
= 1(6.6465 x 10-77 kg)(1.423 x 107 m/s)? 
+5 (3.8864 x 10-75 kg)(—2.432 x 10° m/s)? 
= 6.844 x 10°B J 


Clearly kinetic energy is not conserved in this decay, 
because the initial kinetic energy of the uranium atom 
was zero. However total energy is conserved—if we 
write the total energy as the sum of kinetic energy 
and nuclear energy, then the total initial energy (kinetic 
+ nuclear) is equal to the total final energy (kinetic + 
nuclear). Clearly the gain in kinetic energy occurs as a 
result of a loss in nuclear energy. This is an example of 
the type of radioactive decay called alpha decay, which we 
discuss in more detail in Chapter 12. 





Another application of the principle of conservation of energy occurs when 
a particle moves subject to an external force F’. Corresponding to that external 
force there is often a potential energy U, defined such that (for one-dimensional 


motion) 


dU 
dx 


(1.4) 


The total energy E is the sum of the kinetic and potential energies: 


E=K+U 


(1.5) 


As the particle moves, K and U may change, but —& remains constant. (In 
Chapter 2, we find that the special theory of relativity gives us a new definition of 


total energy.) 
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x 


FIGURE 1.3 A particle of mass m, 
located with respect to the origin 
O by position vector r and moving 
with linear momentum p, has angular 
momentum L about O. 


When a particle moving with linear momentum p is at a displacement r from the 
origin O, its angular momentum L about the point O is defined (see Figure 1.3) by 


—~ = 


L=rxp (1.6) 


There is a conservation law for angular momentum, just as with linear momentum. 
In practice this has many important applications. For example, when a charged 
particle moves near, and is deflected by, another charged particle, the total 
angular momentum of the system (the two particles) remains constant if no net 
external torque acts on the system. If the second particle is so much more massive 
than the first that its motion is essentially unchanged by the influence of the first 
particle, the angular momentum of the first particle remains constant (because 
the second particle acquires no angular momentum). Another application of the 
conservation of angular momentum occurs when a body such as a comet moves 
in the gravitational field of the Sun—the elliptical shape of the comet’s orbit is 
necessary to conserve angular momentum. In this case r and p of the comet must 
simultaneously change so that L remains constant. 


Velocity Addition 


Another important aspect of classical physics is the rule for combining velocities. 
For example, suppose a jet plane is moving at a velocity of vpg = 650m/s, 
as measured by an observer on the ground. The subscripts on the velocity 
mean “velocity of the plane relative to the ground.” The plane fires a missile 
in the forward direction; the velocity of the missile relative to the plane is 
Vyp = 250m/s. According to the observer on the ground, the velocity of the 
missile is: Vyyg = Vyyp + vpg = 250 m/s + 650 m/s = 900 m/s. 

We can generalize this rule as follows. Let ¥,, represent the velocity of A 
relative to B, and let Vp- represent the velocity of B relative to C. Then the velocity 
of A relative to C is 


Vac = Vag t Vac (1.7) 


This equation is written in vector form to allow for the possibility that the 
velocities might be in different directions; for example, the missile might be fired 
not in the direction of the plane’s velocity but in some other direction. This seems 
to be a very “common-sense” way of combining velocities, but we will see later 
in this chapter (and in more detail in Chapter 2) that this common-sense rule can 
lead to contradictions with observations when we apply it to speeds close to the 
speed of light. 

A common application of this rule (for speeds small compared with the 
speed of light) occurs in collisions, when we want to analyze conservation of 
momentum and energy in a frame of reference that is different from the one 
in which the collision is observed. For example, let’s analyze the collision of 
Example 1.1 in a frame of reference that is moving with the center of mass. 
Suppose the initial velocity of the He atom defines the positive x direction. 
The velocity of the center of mass (relative to the laboratory) is then vq, = 
(Vee + YN) / (My. + my) = 3.374 x 10° m/s. We would like to find the 
initial velocity of the He and N relative to the center of mass. If we start with 
ViteL = Vitec + Ver and vy; = Yc + Vc; then 


Vitec = Viet — Ver, = 1.518 x 10° m/s — 3.374 x 10° m/s = 1.181 x 10° m/s 
Wwe = WL — Vey = 0 — 3.374 x 10° m/s = —0.337 x 10° m/s 
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In a similar fashion we can calculate the final velocities of the He and N. 
The resulting collision as viewed from this frame of reference is illustrated in 
Figure 1.4. There is a special symmetry in this view of the collision that is not 
apparent from the same collision viewed in the laboratory frame of reference 
(Figure 1.1); each velocity simply changes direction leaving its magnitude 
unchanged, and the atoms move in opposite directions. The angles in this view of 
the collision are different from those of Figure 1.1, because the velocity addition 
in this case applies only to the x components and leaves the y components 
unchanged, which means that the angles must change. 


Electricity and Magnetism 


The electrostatic force (Coulomb force) exerted by a charged particle g, on 
another charge gy has magnitude 
1 laillaol 


F=— 1.8 
Amey? ve 





The direction of F' is along the line joining the particles (Figure 1.5). In the SI 
system of units, the constant 1/4zre, has the value 


1 
—— = 8.988 x 10° N-m?/C? 
Arey 


The corresponding potential energy is 


1 41% 


= 1.9 
4ney r (1.9) 


In all equations derived from Eq. 1.8 or 1.9 as starting points, the quantity 1/479 
must appear. In some texts and reference books, you may find electrostatic 
quantities in which this constant does not appear. In such cases, the centimeter- 
gram-second (cgs) system has probably been used, in which the constant 1/4zt¢) 
is defined to be 1. You should always be very careful in making comparisons 
of electrostatic quantities from different references and check that the units are 
identical. 

An electrostatic potential difference AV can be established by a distribution of 
charges. The most common example of a potential difference is that between the 
two terminals of a battery. When a charge g moves through a potential difference 
AV, the change in its electrical potential energy AU is 

AU = qAV (1.10) 
At the atomic or nuclear level, we usually measure charges in terms of the basic 
charge of the electron or proton, whose magnitude is e = 1.602 x 107!°C. If 
such charges are accelerated through a potential difference AV that is a few volts, 
the resulting loss in potential energy and corresponding gain in kinetic energy will 
be of the order of 10~!° to 107!8 J. To avoid working with such small numbers, 
it is common in the realm of atomic or nuclear physics to measure energies in 
electron-volts (eV), defined to be the energy of a charge equal in magnitude to 
that of the electron that passes through a potential difference of | volt: 


AU = qAV = (1.602 x 107? C)(1 V) = 1.602 x 10°19 J 





y 








jo <j . 
He N 
(a) 
a 2 : 








) x 
p" 
(b) 


FIGURE 1.4 The collision of Figure 
1.1 viewed from a frame of refer- 
ence moving with the center of mass. 
(a) Before collision. (b) After colli- 
sion. In this frame the two particles 
always move in opposite directions, 
and for elastic collisions the mag- 
nitude of each particle’s velocity is 
unchanged. 

















+ 
=e o- 
F F 
FIGURE 1.5 Two charged particles 
experience equal and opposite elec- 
trostatic forces along the line joining 
their centers. If the charges have the 
same sign (both positive or both nega- 
tive), the force is repulsive; if the signs 

are different, the force is attractive. 


8 Chapter 1 | The Failures of Classical Physics 


3 
B 


(a) 











(b) 





FIGURE 1.6 (a) A circular current 
loop produces a magnetic field B at 
its center. (b) A current loop with 
magnetic moment i in an external 
magnetic field B,,,. The field exerts 
a torque on the loop that will tend to 
rotate it so that ft lines up with Bu. 


and thus 


leV = 1.602 x 107!° J 


Some convenient multiples of the electron-volt are 


keV = kilo electron-volt = 10° eV 
MeV = mega electron-volt = 10° eV 
GeV = giga electron-volt = 10° eV 


(In some older works you may find reference to the BeV, for billion electron-volts; 
this is a source of confusion, for in the United States a billion is 10° while in 
Europe a billion is 10!.) 

Often we wish to find the potential energy of two basic charges separated by 
typical atomic or nuclear dimensions, and we wish to have the result expressed 
in electron-volts. Here is a convenient way of doing this. First we express the 
quantity e? /4zre) in a more convenient form: 


2 


— = (8.988 x 10? N-m?/C7)(1.602 x 107!° C)? = 2.307 x 10778 N- m? 
ITE, 


1 10° 
= (2.307 x 10-28 N- m?) ) ( =) 
1.602 x 10-!9 J/eV m 
= 1.440eV-nm 





With this useful combination of constants it becomes very easy to calculate 
electrostatic potential energies. For two electrons separated by a typical atomic 
dimension of 1.00 nm, Eq. 1.9 gives 


1 2 e 1 
0 


= ) = 1.44eV 
Amey r 


1 
1.00 nm 
For calculations at the nuclear level, the femtometer is a more convenient unit of 
distance and MeV is a more appropriate energy unit: 


é Ret eer 10" fin’) (1MeV) _ 1 ago Mev-4 
47€ 9 oem \ 109 nm Im 10° eV 


It is remarkable (and convenient to remember) that the quantity e”/47e, has the 
same value of 1.440 whether we use typical atomic energies and sizes (eV - nm) 
or typical nuclear energies and sizes (MeV - fm). 

A magnetic field B can be produced by an electric current 7. For example, the 
magnitude of the magnetic field at the center of a circular current loop of radius r 
is (see Figure 1.6a) 








I 

gal 

2r 

The SI unit for magnetic field is the tesla (T), which is equivalent to a newton per 
ampere-meter. The constant jg is 


(1.11) 


[ly = 4a x 10-7 N-8*/C? 


Be sure to remember that i is in the direction of the conventional (positive) current, 
opposite to the actual direction of travel of the negatively charged electrons that 
typically produce the current in metallic wires. The direction of B is chosen 
according to the right-hand rule: if you hold the wire in the right hand with the 
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thumb pointing in the direction of the current, the fingers point in the direction of 
the magnetic field. 
It is often convenient to define the magnetic moment \t of a current loop: 


lit] = iA (1.12) 


where A is the geometrical area enclosed by the loop. The direction of {t is 
perpendicular to the plane of the loop, according to the right-hand rule._ 
When a current loop is placed in a uniform external magnetic field B,,, (as in 


Figure 1.6)), there is a torque t on the loop that tends to line up ft with Bex: 
T=p x By (1.13) 


Another way to describe this interaction is to assign a potential energy to the 
magnetic moment |i in the external field Bex: 


U = —p- Bex (1.14) 


When the field Bex is applied, jt rotates so that its energy tends to a minimum 
value, which occurs when wt and B,,, are parallel. 

It is important for us to understand the properties of magnetic moments, 
because particles such as electrons or protons have magnetic moments. Although 
we don’t imagine these particles to be tiny current loops, their magnetic moments 
do obey Eqs. 1.13 and 1.14. 

A particularly important aspect of electromagnetism is electromagnetic waves. 
In Chapter 3 we discuss some properties of these waves in more detail. Electro- 
magnetic waves travel in free space with speed c (the speed of light), which is 
related to the electromagnetic constants €9 and 9: 


C= (Egbo) ? (1.15) 


The speed of light has the exact value of c = 299,792,458 m/s. 
Electromagnetic waves have a frequency f and wavelength A, which are 
related by 


c=uf (1.16) 


The wavelengths range from the very short (nuclear gamma rays) to the very 
long (radio waves). Figure 1.7 shows the electromagnetic spectrum with the 
conventional names assigned to the different ranges of wavelengths. 


Wavelength (m) 
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FIGURE 1.7 The electromagnetic spectrum. The boundaries of the regions are not sharply defined. 
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Kinetic Theory of Matter 


An example of the successful application of classical physics to the structure 
of matter is the understanding of the properties of gases at relatively low 
pressures and high temperatures (so that the gas is far from the region of pressure 
and temperature where it might begin to condense into a liquid). Under these 
conditions, most real gases can be modeled as ideal gases and are well described 
by the ideal gas equation of state 


PV = NkT (1.17) 


where P is the pressure, V is the volume occupied by the gas, N is the number of 
molecules, 7 is the temperature, and k is the Boltzmann constant, which has the 
value 


k = 1.381 x 10773 J/K 


In using this equation and most of the equations in this section, the temperature 
must be measured in units of kelvins (K). Be careful not to confuse the symbol K 
for the unit of temperature with the symbol K for kinetic energy. 

The ideal gas equation of state can also be expressed as 


PV =nRT (1.18) 


where n is the number of moles and R is the universal gas constant with a 
value of 


R= 8.315 J/mol-K 


One mole of a gas is the quantity that contains a number of fundamental entities 
(atoms or molecules) equal to Avogadro’s constant N,, where 


Ny = 6.022 x 1073 per mole 


That is, one mole of helium contains Ny atoms of He, one mole of nitrogen 
contains NV, molecules of N, (and thus 2N, atoms of N), and one mole of 
water vapor contains N, molecules of H,O (and thus 2N, atoms of H and Ny 
atoms of O). 

Because N =nN, (number of molecules equals number of moles times 
number of molecules per mole), the relationship between the Boltzmann constant 
and the universal gas constant is 


R=KNy (1.19) 


The ideal gas model is very successful for describing the properties of many 
gases. It assumes that the molecules are of negligibly small volume (that is, the 
gas is mostly empty space) and move randomly throughout the volume of the 
container. The molecules make occasional collisions with one another and with 
the walls of the container. The collisions obey Newton’s laws and are elastic and 
of very short duration. The molecules exert forces on one another only during 
collisions. Under these assumptions, there is no potential energy so that kinetic 
energy is the only form of energy that must be considered. Because the collisions 
are elastic, there is no net loss or gain of kinetic energy during the collisions. 
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Individual molecules may speed up or slow down due to collisions, but the 
average kinetic energy of all the molecules in the container does not change. The 
average kinetic energy of a molecule in fact depends only on the temperature: 


K= 3kT (per molecule) (1.20) 


For rough estimates, the quantity kT is often used as a measure of the mean 
kinetic energy per particle. For example, at room temperature (20°C = 293 K), 
the mean kinetic energy per particle is approximately 4 x 1077! J (about 1/40 eV), 
while in the interior of a star where T ~ 10’ K, the mean energy is approximately 
10! J (about 1000 eV). 

Sometimes it is also useful to discuss the average kinetic energy of a mole of 
the gas: 


average K per mole = average K per molecule x number of molecules per mole 


Using Eq. 1.19 to relate the Boltzmann constant to the universal gas constant, we 
find the average molar kinetic energy to be 


K,y = 3RT (per mole) (1.21) 


It should be apparent from the context of the discussion whether K,,, refers to the 
average per molecule or the average per mole. 


1.2 THE FAILURE OF CLASSICAL CONCEPTS 
OF SPACE AND TIME 









In 1905, Albert Einstein proposed the special theory of relativity, which is in 
essence a new way of looking at space and time, replacing the “classical” space 
and time that were the basis of the physical theories of Galileo and Newton. 
Einstein’s proposal was based on a “thought experiment,” but in subsequent years 
experimental data have clearly indicated that the classical concepts of space and 
time are incorrect. In this section we examine how experimental results support 
the need for a new approach to space and time. 


The Failure of the Classical Concept of Time 


In high-energy collisions between two protons, many new particles can be 
produced, one of which is a pi meson (also known as a pion). When the pions are 
produced at rest in the laboratory, they are observed to have an average lifetime (the 
time between the production of the pion and its decay into other particles) of 26.0 ns 
(nanoseconds, or 10~° s). On the other hand, pions in motion are observed to have 
a very different lifetime. In one particular experiment, pions moving at a speed of 
2.737 x 108 m/s (91.3% of the speed of light) showed a lifetime of 63.7 ns. 

Let us imagine this experiment as viewed by two different observers 
(Figure 1.8). Observer #1, at rest in the laboratory, sees the pion moving relative 
to the laboratory at a speed of 91.3% of the speed of light and measures its 

















FIGURE 1.8 (a) The pion experiment 
according to O,. Markers A and 
B respectively show the locations 
of the pion’s creation and decay. 
(b) The same experiment as viewed 
by QO, relative to whom the pion is 
at rest and the laboratory moves with 
velocity —v. 
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lifetime to be 63.7 ns. Observer #2 is moving relative to the laboratory at exactly 
the same velocity as the pion, so according to observer #2 the pion is at rest and 
has a lifetime of 26.0 ns. The two observers measure different values for the time 
interval between the same two events—the formation of the pion and its decay. 

According to Newton, time is the same for all observers. Newton’s laws are 
based on this assumption. The pion experiment clearly shows that time is not 
the same for all observers, which indicates the need for a new theory that relates 
time intervals measured by different observers who are in motion with respect to 
each other. 


The Failure of the Classical Concept of Space 


The pion experiment also leads to a failure of the classical ideas about space. 
Suppose observer #1 erects two markers in the laboratory, one where the pion 
is created and another where it decays. The distance D, between the two 
markers is equal to the speed of the pion multiplied by the time interval from 
its creation to its decay: D, = (2.737 x 108 m/s)(63.7 x 10~? s) = 17.4m. To 
observer #2, traveling at the same velocity as the pion, the laboratory appears 
to be rushing by at a speed of 2.737 x 10° m/s and the time between passing 
the first and second markers, showing the creation and decay of the pion in the 
laboratory, is 26.0 ns. According to observer #2, the distance between the markers 
is D) = (2.737 x 10° m/s)(26.0 x 10~? s) = 7.11 m. Once again, we have two 
observers in relative motion measuring different values for the same interval, in 
this case the distance between the two markers in the laboratory. The physical 
theories of Galileo and Newton are based on the assumption that space is the 
same for all observers, and so length measurements should not depend on relative 
motion. The pion experiment again shows that this cornerstone of classical physics 
is not consistent with modern experimental data. 


The Failure of the Classical Concept 
of Velocity 


Classical physics places no limit on the maximum velocity that a particle can 
reach. One of the basic equations of kinematics, v = vg + at, shows that if a 
particle experiences an acceleration a for a long enough time f, velocities as large 
as desired can be achieved, perhaps even exceeding the speed of light. For another 
example, when an aircraft flying at a speed of 200 m/s relative to an observer 
on the ground launches a missile at a speed of 250 m/s relative to the aircraft, a 
ground-based observer would measure the missile to travel at a speed of 200 m/s + 
250 m/s = 450 m/s, according to the classical velocity addition rule (Eq. 1.7). We 
can apply that same reasoning to a spaceship moving at a speed of 2.0 x 10° m/s 
(relative to an observer on a space station), which fires a missile at a speed of 
2.5 x 108 m/s relative to the spacecraft. We would expect that the observer on 
the space station would measure a speed of 4.5 x 10° m/s for the missile. This 
speed exceeds the speed of light (3.0 x 10° m/s). Allowing speeds greater than 
the speed of light leads to a number of conceptual and logical difficulties, such as 
the reversal of the normal order of cause and effect for some observers. 

Here again modern experimental results disagree with the classical ideas. Let’s 
go back again to our experiment with the pion, which is moving through the 
laboratory at a speed of 2.737 x 10° m/s. The pion decays into another particle, 
called a muon, which is emitted in the forward direction (the direction of the 
pion’s velocity) with a speed of 0.813 x 10° m/s relative to the pion. According to 
Eq. 1.7, an observer in the laboratory should observe the muon to be moving with 
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a velocity of 2.737 x 108 m/s + 0.813 x 108 m/s = 3.550 x 10° m/s, exceeding 
the speed of light. The observed velocity of the muon, however, is 2.846 x 
108 m/s, below the speed of light. Clearly the classical rule for velocity addition 
fails in this experiment. 

The properties of time and space and the rules for combining velocities are 
essential concepts of the classical physics of Newton. These concepts are derived 
from observations at low speeds, which were the only speeds available to Newton 
and his contemporaries. In Chapter 2, we shall discover how the special theory 
of relativity provides the correct procedure for comparing measurements of time, 
distance, and velocity by different observers and thereby removes the failures of 
classical physics at high speed (while reducing to the classical laws at low speed, 
where we know the Newtonian framework works very well). 


1.3 THE FAILURE OF THE CLASSICAL THEORY 


OF PARTICLE STATISTICS 





Thermodynamics and statistical mechanics were among the great triumphs of 
19th-century physics. Describing the behavior of complex systems of many 
particles was shown to be possible using a small number of aggregate or average 
properties—for example, temperature, pressure, and heat capacity. Perhaps the 
crowning achievement in this field was the development of relationships between 
macroscopic properties, such as temperature, and microscopic properties, such as 
the molecular kinetic energy. 

Despite these great successes, this statistical approach to understanding the 
behavior of gases and solids also showed a spectacular failure. Although the 
classical theory gave the correct heat capacities of gases at high temperatures, it 
failed miserably for many gases at low temperatures. In this section we summarize 
the classical theory and explain how it fails at low temperatures. This failure 
directly shows the inadequacy of classical physics and the need for an approach 
based on quantum theory, the second of the great theories of modern physics. 


The Distribution of Molecular Energies 


In addition to the average kinetic energy, it is also important to analyze the 
distribution of kinetic energies—that is, what fraction of the molecules in the 
container has kinetic energies between any two values K, and K,. For a gas in 
thermal equilibrium at absolute temperature 7 (in kelvins), the distribution of 
molecular energies is given by the Maxwell-Boltzmann distribution: 


2N 1 
N(E) = — gre 1.22 
In this equation, N is the total number of molecules (a pure number) while N(£) 
is the distribution function (with units of energy!) defined so that N(E)dE is the 
number of molecules dN in the energy interval dE at E (or, in other words, the 
number of molecules with energies between FE and E + dE): 





dN = N(E)dE (1.23) 


The distribution NV (£) is shown in Figure 1.9. The number dN is represented by the 
area of the narrow strip between E and E + dE. If we divide the entire horizontal 
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FIGURE 1.9 The Maxwell-Boltzmann energy distribution function, shown for 
one mole of gas at room temperature (300 K). 


axis into an infinite number of such small intervals and add the areas of all the 
resulting narrow strips, we obtain the total number of molecules in the gas: 


= ‘os 2N 1 1/2 ,—E/kT 


The final step in this calculation involves the definite integral ic x? ode, 
which you can find in tables of integrals. Also using calculus techniques (see 
Problem 8), you can show that the peak of the distribution function (the most 
probable energy) is SAT ‘ 

The average energy in this distribution of molecules can also be found by 
dividing the distribution into strips. To find the contribution of each strip to 
the energy of the gas, we multiply the number of molecules in each strip, 
dN = N(E)dE, by the energy E of the molecules in that strip, and then we add 
the contributions of all the strips by integrating over all energies. This calculation 
would give the total energy of the gas; to find the average we divide by the total 
number of molecules NV: 


ee > [anwar = [= : EP eB” dE (1.25) 
0 o x (AT)? : 





av N 
Once again, the definite integral can be found in integral tables. The result of 


carrying out the integration is 
E yy = 3kT (1.26) 


Equation 1.26 gives the average energy of a molecule in the gas and agrees 
precisely with the result given by Eq. 1.20 for the ideal gas in which kinetic 
energy is the only kind of energy the gas can have. 

Occasionally we are interested in finding the number of molecules in our 
distribution with energies between any two values £, and £). If the interval 
between E£ and £, is very small, Eq. 1.23 canbe used, with dE = E, — E, and with 
N(E) evaluated at the midpoint of the interval. This approximation works very well 
when the interval is small enough that N(£) is either approximately flat or linear 
over the interval. If the interval is large enough that this approximation is not valid, 
then it is necessary to integrate to find the number of molecules in the interval: 


£2 2N 1 
ib, V DSP 


This number is represented by the shaded area in Figure 1.9. This integral cannot 
be evaluated directly and must be found numerically. 


Be eh ap (1.27) 





Ey 
N(Ey £)= f N(E) dE = 
Ey 


1.3 | The Failure of the Classical Theory of Particle Statistics 15 


J Example 1.3 





(a) In one mole of a gas at a temperature of 650K (AT = 
8.97 x 10-7! J = 0.0560eV), calculate the number of 
molecules with energies between 0.0105 eV and 0.0135 eV. 
(b) In this gas, calculate the fraction of the molecules with 
energies in the range of 42.5% of the most probable 
energy (GAT ): 
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FIGURE 1.10 Example 1.3. 


Solution 

(a) Figure 1.10a shows the distribution NV (£) in the region 
between £, = 0.0105 eV and E, = 0.0135 eV. Because the 
graph is very close to linear in this region, we can use Eq. 
1.23 to find the number of molecules in this range. We 
take dE to be the width of the range, dE = E, — E, = 
0.0135 eV — 0.0105 eV = 0.0030 eV, and for E we use the 
energy at the midpoint of the range (0.0120 eV): 


dN = N(E)dE 
_2N 1 


J (kT)3/2 
= 2(6.022 x 1073)(0.0120 eV)!/* 1—'/2(0.0560 eV) 3/2 


x e7 (0.0120 eV)/(0.0560 eV) (0.0030 eV) 





E'/2¢-E/T gp 


= 1.36 x 107% 


(b) Figure 1.106 shows the distribution in this region. To 
find the fraction of the molecules in this energy range, we 
want dN /N. The most probable energy is 5kT or 0.0280 eV, 
and +2.5% of this value corresponds to +0.0007 eV or 
a range from 0.0273 eV to 0.0287eV. The fraction is 


dN N(E)dE_ 2 1 
N- N  fw (kT)3/ 


= 2(0.0280 eV)!/? 1—!/2(0.0560 eV)~3/? 
xe7 (0.0280 eV)/(0.0560 eV) (0.0014 eV) 


E'/2¢-E/KT gp 





= 0.0121 





Note from these examples how we use a distribution function. We do not use 
Eq. 1.22 to calculate the number of molecules at a particular energy. In this way 
N(£) differs from many of the functions you have encountered previously in your 
study of physics and mathematics. We always use the distribution function to 
calculate how many events occur in a certain interval of values rather than at an 
exact particular value. There are two reasons for this: (1) Asking the question in 
the form of how many molecules have a certain value of the energy implies that 
the energy is known exactly (to an infinite number of decimal places), and there 
is zero probability to find a molecule with that exact value of the energy. (2) Any 
measurement apparatus accepts a finite range of energies (or speeds) rather than 
a single exact value, and thus asking about intervals is a better representation of 


what can be measured in the laboratory. 
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Note that V(E) has dimensions of energy~! —it gives the number of molecules 
per unit energy interval (for example, number of molecules per eV). To get an 
actual number that can be compared with measurement, N(£) must be multiplied 
by an energy interval. In our study of modern physics, we will encounter many 
different types of distribution functions whose use and interpretation are similar 
to that of V(£). These functions generally give a number or a probability per 
some sort of unit interval (for example, probability per unit volume), and to use 
the distribution function to calculate an outcome we must always multiply by an 
appropriate interval (for example, an element of volume). Sometimes we will be 
able to deal with small intervals using a relationship similar to Eq. 1.23, as we did 
in Example 1.3, but in other cases we will find the need to evaluate an integral, as 
we did in Eq. 1.27. 


Polyatomic Molecules and the Equipartition 
of Energy 


So far we have been considering gases with only one atom per molecule 
(monatomic gases). For “point” molecules with no internal structure, only one 
form of energy is important: translational kinetic energy smv>. (We call this 
“translational” kinetic energy because it describes motion as the gas particles 
move from one location to another. Soon we will also consider rotational kinetic 
energy.) 

Let’s rewrite Eq. 1.26 in a more instructive form by recognizing that, with 
translational kinetic energy as the only form of energy, E = K = smy. With 
v=vt+ %, + v2, we can write the energy as 


E= 5mvz a smv =F smv2 (1.28) 
The average energy is then 
SMV ay + 5M ay + MVE )ay = AT (1.29) 


For a gas molecule there is no difference between the x,y, and z directions, so 
the three terms on the left are equal and each term is equal to SkT . The three 
terms on the left represent three independent contributions to the energy of the 
molecule—the motion in the x direction, for example, is not affected by the y or 
z motions. 

We define a degree of freedom of the gas as each independent contribution to 
the energy of a molecule, corresponding to one quadratic term in the expression 
for the energy. There are three quadratic terms in Eq. 1.28, so in this case there 
are three degrees of freedom. As you can see from Eq. 1.29, each of the three 
degrees of freedom of a gas molecule contributes an energy of SkT to its average 
energy. The relationship we have obtained in this special case is an example of 
the application of a general theorem, called the equipartition of energy theorem: 


When the number of particles in a system is large and Newtonian mechanics 
is obeyed, each molecular degree of freedom corresponds to an average 
energy of SAT ‘ 


The average energy per molecule is then the number of degrees of freedom times 
SAT. , and the total energy is obtained by multiplying the average energy per 
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molecule by the number of molecules N: E,o14, = NEj,. We will refer to this total 
energy as the internal energy E,,, to indicate that it represents the random motions 
of the gas molecules (in contrast, for example, to the energy involved with the 
motion of the entire container of gas molecules). 


Eng = N(3KT) = 3.NkT = 3nRT (translation only) (1.30) 


where Eq. 1.19 has been used to express Eq. 1.30 in terms of either the number 
of molecules or the number of moles. 

The situation is different for a diatomic gas (two atoms per molecule), 
illustrated in Figure 1.11. There are still three degrees of freedom associated with 
the translational motion of the molecule, but now two additional forms of energy 
are permitted—rotational and vibrational. 

First we consider the rotational motion. The molecule shown in Figure 1.11 
can rotate about the x’ and y’ axes (but not about the z’ axis, because the rotational 
inertia about that axis is zero for diatomic molecules in which the atoms are treated 
as points). Using the general form of SI w* for the rotational kinetic energy, we 
can write the energy of the molecule as 

E= smve + smv, + 5mv; + 51,0, + ayy 
Here we have 5 quadratic terms in the energy, and thus 5 degrees of freedom. 
According to the equipartition theorem, the average total energy per molecule is 
5 x 5kT = 3KT, and the total internal energy of n moles of the gas is 


(1.31) 


En = $nRT (translation + rotation) (1.32) 


I 


If the molecule can also vibrate, we can imagine the rigid rod connecting the 
atoms in Figure 1.11 to be replaced by a spring. The two atoms can then vibrate 
in opposite directions along the z’ axis, with the center of mass of the molecule 
remaining fixed. The vibrational motion adds two quadratic terms to the energy, 
corresponding to the vibrational potential energy (5 kz ) and the vibrational 
kinetic energy (4mv2,). Including the vibrational motion, there are now 7 degrees 
of freedom, so that 


E inRT (translation + rotation + vibration) (1.33) 


int = 
Heat Capacities of an Ideal Gas 


Now we examine where the classical molecular distribution theory, which gives 
a very good accounting of molecular behavior under most circumstances, fails to 
agree with one particular class of experiments. Suppose we have a container of 
gas with a fixed volume. We transfer energy to the gas, perhaps by placing the 
container in contact with a system at a higher temperature. All of this transferred 
energy increases the internal energy of the gas by an amount A£;,,, and there is 
an accompanying increase in temperature AT. 
We define the molar heat capacity for this constant-volume process as 
AE int 
v= or (1.34) 
(The subscript V reminds us that we are doing this measurement at constant 
volume.) From Eqs. 1.30, 1.32, and 1.33, we see that the molar heat capacity 
depends on the type of gas: 





ii 
FIGURE 1.11 A diatomic molecule, 
with the origin at the center of mass. 
Rotations can occur about the x’ and y’ 
axes, and vibrations can occur along 
the z’ axis. 
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Cy = 3R (monatomic or nonrotating, nonvibrating diatomic ideal gas) 
Cy 


Cy = IR (rotating and vibrating diatomic ideal gas) 


3R (rotating diatomic ideal gas) (1.35) 


When we add energy to the gas, the equipartition theorem tells us that the added 
energy will on the average be distributed uniformly among all the possible forms 
of energy (corresponding to the number of degrees of freedom). However, only 
the translational kinetic energy contributes to the temperature (as shown by Eqs. 
1.20 and 1.21). Thus, if we add 7 units of energy to a diatomic gas with rotating 
and vibrating molecules, on the average only 3 units go into translational kinetic 
energy and so 3/7 of the added energy goes into increasing the temperature. 
(To measure the temperature rise, the gas molecules must collide with the 
thermometer, so energy in the rotational and vibrational motions is not recorded 
by the thermometer.) Put another way, to obtain the same temperature increase 
AT, a mole of diatomic gas requires 7/3 times the energy that is needed for a 
mole of monatomic gas. 


Comparison with Experiment How well do these heat capacity values 
agree with experiment? For monatomic gases, the agreement is very good. The 
equipartition theorem predicts a value of Cy = 3R/2 = 12.5J/mol-K, which 
should be the same for all monatomic gases and the same at all temperatures (as 
long as the conditions of the ideal gas model are fulfilled). The heat capacity 
of He gas is 12.5J/mol-K at 100K, 300K (room temperature), and 1000 K, so 
in this case our calculation is in perfect agreement with experiment. Other inert 
gases (Ne, Ar, Xe, etc.) have identical values, as do vapors of metals (Cu, Na, Pb, 
Bi, etc.) and the monatomic (dissociated) state of elements that normally form 
diatomic molecules (H, N, O, Cl, Br, etc.). So over a wide variety of different 
elements and a wide range of temperatures, classical statistical mechanics is in 
excellent agreement with experiment. 

The situation is much less satisfactory for diatomic molecules. For a rotating 
and vibrating diatomic molecule, the classical calculation gives Cy = 7R/2 = 
29.1 J/mol-K. Table 1.1 shows some values of the heat capacities for different 
diatomic gases over a range of temperatures. 


TABLE 1.1 Heat Capacities of Diatomic Gases 
































Cy (J/mol- kK) 
Element |} 100K 300K 1000K 

H, 18.7 20.5 PANES) 
N, 20.8 20.8 24.4 
O, 20.8 21.1 26.5 
F, 20.8 23.0 28.8 
Cl, 21.0 25.6 29.1 
Br, 22.6 27.8 DOES 
I, 24.8 28.6 Doe) 
Sb, 28.1 29.0 
Te, DBD. 29.0 
Bi, 28.6 29.1 
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At high temperatures, many of the diatomic gases do indeed approach the 
expected value of 7R/2, but at lower temperatures the values are much smaller. 
For example, fluorine seems to behave as if it has 5 degrees of freedom 
(Cy = 20.8 J/mol-K) at 100K and 7 degrees of freedom (Cy = 29.1 J/mol-K) 
at 1000 K. 

Hydrogen behaves as if it has 5 degrees of freedom at room temperature, 
but at high enough temperature (3000 K), the heat capacity of H, approaches 
29.1 J/mol - K, corresponding to 7 degrees of freedom, while at lower temperatures 
(40 K) the heat capacity is 12.5 J/mol- K, corresponding to 3 degrees of freedom. 
The temperature dependence of the heat capacity of H, is shown in Figure 1.12. 
There are three plateaus in the graph, corresponding to heat capacities for 3, 5, and 
7 degrees of freedom. At the lowest temperatures, the rotational and vibrational 
motions are “frozen” and do not contribute to the heat capacity. At about 100 K, 
the molecules have enough energy to allow rotational motion to occur, and by 
about 300 K the heat capacity is characteristic of 5 degrees of freedom. Starting 
about 1000 K, the vibrational motion can occur, and by about 3000 K there are 
enough molecules above the vibrational threshold to allow 7 degrees of freedom. 

What’s going on here? The classical calculation demands that Cy, should be 
constant, independent of the type of gas or the temperature. The equipartition 
of energy theorem, which is very successful in predicting many thermodynamic 
properties, fails miserably in accounting for the heat capacities. This theorem 
requires that the energy added to a gas must on average be divided equally among 
all the different forms of energy, and classical physics does not permit a threshold 
energy for any particular type of motion. How is it possible for 2 degrees of 
freedom, corresponding to the rotational or vibrational motions, to be “turned on” 
as the temperature is increased? 

The solution to this dilemma can be found in quantum mechanics, according 
to which there is indeed a minimum or threshold energy for the rotational and 
vibrational motions. We discuss this behavior in Chapters 5 and 9. In Chapter 11 
we discuss the failure of the equipartition theorem to account for the heat capacities 
of solids and the corresponding need to replace the classical Maxwell-Boltzmann 
energy distribution function with a different distribution that is consistent with 
quantum mechanics. 
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FIGURE 1.12 The heat capacity of molecular hydrogen at different temperatures. 
The data points disagree with the classical prediction. 
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1.4 THEORY, EXPERIMENT, LAW 


When you first began to study science, perhaps in your elementary or high school 
years, you may have learned about the “scientific method,” which was supposed 
to be a sort of procedure by which scientific progress was achieved. The basic 
idea of the “scientific method” was that, on reflecting over some particular aspect 
of nature, the scientist would invent a hypothesis or theory, which would then be 
tested by experiment and if successful would be elevated to the status of Jaw. This 
procedure is meant to emphasize the importance of doing experiments as a way of 
testing hypotheses and rejecting those that do not pass the tests. For example, the 
ancient Greeks had some rather definite ideas about the motion of objects, such as 
projectiles, in the Earth’s gravity. Yet they tested none of these by experiment, so 
convinced were they that the power of logical deduction alone could be used to 
discover the hidden and mysterious laws of nature and that once logic had been 
applied to understanding a problem, no experiments were necessary. If theory 
and experiment were to disagree, they would argue, then there must be something 
wrong with the experiment! This dominance of analysis and faith was so pervasive 
that it was another 2000 years before Galileo, using an inclined plane and a crude 
timer (equipment surely within the abilities of the early Greeks to construct), dis- 
covered the laws of motion, which were later organized and analyzed by Newton. 

In the case of modern physics, none of the fundamental concepts is obvious 
from reason alone. Only by doing often difficult and necessarily precise experi- 
ments do we learn about these unexpected and fascinating effects associated with 
such modern physics topics as relativity and quantum physics. These experiments 
have been done to unprecedented levels of precision—of the order of one part 
in 10° or better—and it can certainly be concluded that modern physics was 
tested far better in the 20th century than classical physics was tested in all of the 
preceding centuries. 

Nevertheless, there is a persistent and often perplexing problem associated 
with modern physics, one that stems directly from your previous acquaintance 
with the “scientific method.” This concerns the use of the word “theory,” as in 
“theory of relativity” or “quantum theory,” or even “atomic theory” or “theory 
of evolution.” There are two contrasting and conflicting definitions of the word 
“theory” in the dictionary: 


1. A hypothesis or guess. 
2. An organized body of facts or explanations. 


The “scientific method” refers to the first kind of “theory,” while when we 
speak of the “theory of relativity” we refer to the second kind. Yet there is 
often confusion between the two definitions, and therefore relativity and quantum 
physics are sometimes incorrectly regarded as mere hypotheses, on which evidence 
is still being gathered, in the hope of someday submitting that evidence to some sort 
of international (or intergalactic) tribunal, which in turn might elevate the “theory” 
into a “law.” Thus the “theory of relativity” might someday become the “law of 
relativity,” like the “law of gravity.” Nothing could be further from the truth! 

The theory of relativity and the quantum theory, like the atomic theory or the 
theory of evolution, are truly “organized bodies of facts and explanations” and 
not “hypotheses.” There is no question of these “theories” becoming “laws”—the 
“facts” (experiments, observations) of relativity and quantum physics, like those 
of atomism or evolution, are accepted by virtually all scientists today. The 
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experimental evidence for all of these processes is so compelling that no one who 
approaches them in the spirit of free and open inquiry can doubt the observational 
evidence or their inferences. Whether these collections of evidence are called 
theories or laws is merely a question of semantics and has nothing to do with their 
scientific merits. Like all scientific principles, they will continue to develop and 
change as new discoveries are made; that is the essence of scientific progress. 
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Under what conditions can you apply the law of conservation 
of energy? Conservation of linear momentum? Conservation 
of angular momentum? 

Which of the conserved quantities are scalars and which are 
vectors? Is there a difference in how we apply conservation 
laws for scalar and vector quantities? 

What other conserved quantities (besides energy, linear 
momentum, and angular momentum) can you name? 

What is the difference between potential and potential 
energy? Do they have different dimensions? Different units? 
In Section 1.1 we defined the electric force between two 
charges and the magnetic field of a current. Use these quan- 
tities to define the electric field of a single charge and the 
magnetic force on a moving electric charge. 

Other than from the ranges of wavelengths shown in 
Figure 1.7, can you think of a way to distinguish radio 
waves from infrared waves? Visible from infrared? That is, 


could you design a radio that could be tuned to infrared 
waves? Could living beings “see” in the infrared region? 
Suppose we have a mixture of an equal number N of 
molecules of two different gases, whose molecular masses 
are m, and m,, in complete thermal equilibrium at tempera- 
ture 7. How do the distributions of molecular energies of the 
two gases compare? How do their average kinetic energies 
per molecule compare? 

In most gases (as in the case of hydrogen) the rotational 
motion begins to occur at a temperature well below the 
temperature at which vibrational motion occurs. What does 
this tell us about the properties of the gas molecules? 
Suppose it were possible for a pitcher to throw a baseball 
faster than the speed of light. Describe how the flight of the 
ball from the pitcher’s hand to the catcher’s glove would 
look to the umpire standing behind the catcher. 


22 


10. 


Chapter 1 | The Failures of Classical Physics 


At low temperatures the molar heat capacity of carbon diox- 
ide (CO,) is about 5R/2, and it rises to about 7R/2 at 
room temperature. However, unlike the gases discussed in 
Section 1.3, the heat capacity of CO, continues to rise as the 
temperature increases, reaching 11R/2 at 1000 K. How can 
you explain this behavior? 


1.1 Review of Classical Physics 


Is 


A hydrogen atom (m = 1.674 x 1077 kg) is moving with 
a velocity of 1.1250 x 107 m/s. It collides elastically with 
a helium atom (m = 6.646 x 10~*’ kg) at rest. After the 
collision, the hydrogen atom is found to be moving with 
a velocity of —6.724 x 10° m/s (in a direction opposite to 
its original motion). Find the velocity of the helium atom 
after the collision in two different ways: (a) by applying 
conservation of momentum; (5) by applying conservation 
of energy. 

A helium atom (m = 6.6465 x 107?’ kg) collides elastically 
with an oxygen atom (m = 2.6560 x 10-76 kg) at rest. After 
the collision, the helium atom is observed to be moving with 
a velocity of 6.636 x 10° m/s in a direction at an angle of 
84.7° relative to its original direction. The oxygen atom is 
observed to move at an angle of —40.4°. (a) Find the speed 
of the oxygen atom. (b) Find the speed of the helium atom 
before the collision. 

A beam of helium-3 atoms (m = 3.016u) is incident on a 
target of nitrogen-14 atoms (m = 14.003 u) at rest. During 
the collision, a proton from the helium-3 nucleus passes to 
the nitrogen nucleus, so that following the collision there 
are two atoms: an atom of “heavy hydrogen” (deuterium, 
m = 2.014u) and an atom of oxygen-15 (m = 15.003 u). 
The incident helium atoms are moving at a velocity of 
6.346 x 10° m/s. After the collision, the deuterium atoms 
are observed to be moving forward (in the same direction as 
the initial helium atoms) with a velocity of 1.531 x 107 m/s. 
(a) What is the final velocity of the oxygen-15 atoms? 
(b) Compare the total kinetic energies before and after the 
collision. 

An atom of beryllium (m = 8.00 u) splits into two atoms of 
helium (m = 4.00 u) with the release of 92.2 keV of energy. 
If the original beryllium atom is at rest, find the kinetic 
energies and speeds of the two helium atoms. 

A 4.15-volt battery is connected across a parallel-plate 
capacitor. Illuminating the plates with ultraviolet light causes 
electrons to be emitted from the plates with a speed of 
1.76 x 10° m/s. (a) Suppose electrons are emitted near the 
center of the negative plate and travel perpendicular to that 
plate toward the opposite plate. Find the speed of the elec- 
trons when they reach the positive plate. (6) Suppose instead 
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If we double the temperature of a gas, is the number of 
molecules in a narrow interval dE around the most probable 
energy about the same, double, or half what it was at the 
original temperature? 


that electrons are emitted perpendicular to the positive plate. 
Find their speed when they reach the negative plate. 


1.2 The Failure of Classical Concepts of Space and Time 


6. 


Observer A, who is at rest in the laboratory, is studying a 
particle that is moving through the laboratory at a speed of 
0.624c and determines its lifetime to be 159 ns. (a) Observer 
A places markers in the laboratory at the locations where 
the particle is produced and where it decays. How far apart 
are those markers in the laboratory? (b) Observer B, who 
is traveling parallel to the particle at a speed of 0.624c, 
observes the particle to be at rest and measures its lifetime to 
be 124 ns. According to B, how far apart are the two markers 
in the laboratory? 


1.3 The Failure of the Classical Theory of Particle Statistics 


Ts 


10. 


A sample of argon gas is in a container at 35.0°C and 
1.22 atm pressure. The radius of an argon atom (assumed 
spherical) is 0.710 x 10~!° m. Calculate the fraction of the 
container volume actually occupied by the atoms. 

By differentiating the expression for the Maxwell- 
Boltzmann energy distribution, show that the peak of the 
distribution occurs at an energy of SAT . 

A container holds N molecules of nitrogen gas at T = 280K. 
Find the number of molecules with kinetic energies between 
0.0300 eV and 0.0312 eV. 

A sample of 2.37 moles of an ideal diatomic gas experi- 
ences a temperature increase of 65.2 K at constant volume. 
(a) Find the increase in internal energy if only translational 
and rotational motions are possible. (b) Find the increase 
in internal energy if translational, rotational, and vibrational 
motions are possible. (c) How much of the energy calculated 
in (a) and (d) is translational kinetic energy? 


General Problems 
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12. 


An atom of mass m, = m moving in the x direction with 
speed v, =v collides elastically with an atom of mass 
m, = 3m at rest. After the collision the first atom moves in 
the y direction. Find the direction of motion of the second 
atom and the speeds of both atoms (in terms of v) after the 
collision. 

An atom of mass m, = m moves in the positive x direction 
with speed v, = v. It collides with and sticks to an atom of 
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14. 


mass m, = 2m moving in the positive y direction with speed 
v, = 2v/3. Find the resultant speed and direction of motion 
of the combination, and find the kinetic energy lost in this 
inelastic collision. 

Suppose the beryllium atom of Problem 4 were not at rest, 
but instead moved in the positive x direction and had a 
kinetic energy of 40.0 keV. One of the helium atoms is 
found to be moving in the positive x direction. Find the 
direction of motion of the second helium, and find the veloc- 
ity of each of the two helium atoms. Solve this problem in 
two different ways: (a) by direct application of conservation 
of momentum and energy; (4) by applying the results of 
Problem 4 to a frame of reference moving with the original 
beryllium atom and then switching to the reference frame in 
which the beryllium is moving. 

Suppose the beryllium atom of Problem 4 moves in the posi- 
tive x direction and has kinetic energy 60.0 keV. One helium 
atom is found to move at an angle of 30° with respect to the 
x axis. Find the direction of motion of the second helium 
atom and find the velocity of each helium atom. Work this 
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problem in two ways as you did the previous problem. (Hint: 
Consider one helium to be emitted with velocity components 
v, and v, in the beryllium rest frame. What is the relationship 
between v, and v,? How do v, and v, change when we move 
in the x direction at speed v?) 

A gas cylinder contains argon atoms (m = 40.0 u). The tem- 
perature is increased from 293 K (20°C) to 373 K (100°C). 
(a) What is the change in the average kinetic energy per 
atom? (b) The container is resting on a table in the Earth’s 
gravity. Find the change in the vertical position of the con- 
tainer that produces the same change in the average energy 
per atom found in part (a). 

Calculate the fraction of the molecules in a gas that are 
moving with translational kinetic energies between 0.02kT 
and 0.04kT. 

For a molecule of O, at room temperature (300 K), calculate 
the average angular velocity for rotations about the x’ or y’ 
axes. The distance between the O atoms in the molecule is 
0.121 nm. 








THE SPECIAL THEORY OF 
RELATIVITY 


This 12-foot tall statue of Albert Einstein is located at the headquarters of the National 
Academy of Sciences in Washington DC. The page in his hand shows three equations that 
he discovered: the fundamental equation of general relativity, which revolutionized our 
understanding of gravity; the equation for the photoelectric effect, which opened the path to 
the development of quantum mechanics; and the equation for mass-energy equivalence, 
which is the cornerstone of his special theory of relativity. 
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Einstein’s special theory of relativity and Planck’s quantum theory burst forth 
on the physics scene almost simultaneously during the first decade of the 20th 
century. Both theories caused profound changes in the way we view our universe 
at its most fundamental level. 

In this chapter we study the special theory of relativity.* This theory has 
a completely undeserved reputation as being so exotic that few people can 
understand it. On the contrary, special relativity is basically a system of kinematics 
and dynamics, based on a set of postulates that are different from those of classical 
physics. The resulting formalism is not much more complicated than Newton’s 
laws, but it does lead to several predictions that seem to go against our common 
sense. Even so, the special theory of relativity has been carefully and thoroughly 
tested by experiment and found to be correct in all its predictions. 

We first review the classical relativity of Galileo and Newton, and then we show 
why Einstein proposed to replace it. We then discuss the mathematical aspects of 
special relativity, the predictions of the theory, and finally the experimental tests. 


2.1 CLASSICAL RELATIVITY 


A “theory of relativity” is in effect a way for observers in different frames of 
reference to compare the results of their observations. For example, consider 
an observer in a car parked by a highway near a large rock. To this observer, 
the rock is at rest. Another observer, who is moving along the highway in a car, 
sees the rock rush past as the car drives by. To this observer, the rock appears 
to be moving. A theory of relativity provides the conceptual framework and 
mathematical tools that enable the two observers to transform a statement such 
as “rock is at rest” in one frame of reference to the statement “rock is in motion” 
in another frame of reference. More generally, relativity gives a means for 
expressing the laws of physics in different frames of reference. 

The mathematical basis for comparing the two descriptions is called a transfor- 
mation. Figure 2.1 shows an abstract representation of the situation. Two observers 
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FIGURE 2.1 Two observers O and O’ observe the same event. O’ moves relative 
to O with a constant velocity u. 


*The general theory of relativity, which is covered briefly in Chapter 15, deals with “curved” 
coordinate systems, in which gravity is responsible for the curvature. Here we discuss the special case 
of the more familiar “flat” coordinate systems. 


O and O’ are each at rest in their own frames of reference but move relative to one 
another with constant velocity u. (O and O’ refer both to the observers and their 
reference frames or coordinate systems.) They observe the same event, which 
happens at a particular point in space and a particular time, such as a collision 
between two particles. According to O, the space and time coordinates of the 
event are x, y, z, t, while according to O’ the coordinates of the same event are x’, 
y, z, . The two observers use calibrated meter sticks and synchronized clocks, 
so any differences between the coordinates of the two events are due to their 
different frames of reference and not to the measuring process. We simplify the 
discussion by assuming that the relative velocity u always lies along the common 
xx’ direction, as shown in Figure 2.1, and we let u represent the velocity of O' as 
measured by O (and thus O’ would measure velocity —u for O). 

In this discussion we make a particular choice of the kind of reference frames 
inhabited by O and O’. We assume that each observer has the capacity to test 
Newton’s laws and finds them to hold in that frame of reference. For example, each 
observer finds that an object at rest or moving with a constant velocity remains 
in that state unless acted upon by an external force (Newton’s first law, the law 
of inertia). Such frames of reference are called inertial frames. An observer in 
interstellar space floating in a nonrotating rocket with the engines off would be in 
an inertial frame of reference. An observer at rest on the surface of the Earth is 
not in an inertial frame, because the Earth is rotating about its axis and orbiting 
about the Sun; however, the accelerations associated with those motions are so 
small that we can usually regard our reference frame as approximately inertial. 
(The noninertial reference frame at the Earth’s surface does produce important 
and often spectacular effects, such as the circulation of air around centers of high 
or low pressure.) An observer in an accelerating car, a rotating merry-go-round, 
or a descending roller coaster is not in an inertial frame of reference! 

We now derive the classical or Galilean transformation that relates the 
coordinates x, y, z, t tox’, y, 2’, ¢. We assume as a postulate of classical physics 
that t = /’, that is, time is the same for all observers. We also assume for simplicity 
that the coordinate systems are chosen so that their origins coincide at ¢ = 0. 
Consider an object in O’ at the coordinates x’, y’, z’ (Figure 2.2). According to O, 
the y and z coordinates are the same as those in O’. Along the x direction, O would 
observe the object at x =x’ + ut. We therefore have the Galilean coordinate 
transformation 


x=x-ut yay 2=% (2.1) 


To find the velocities of the object as observed by O and O’, we take the derivatives 
of these expressions with respect to ¢’ on the left and with respect to ¢ on the 
right (which we can do because we have assumed ¢’ = 1). This gives the Galilean 
velocity transformation 
J i / 

Vy = Vy —U yW=y V,=V, (2.2) 
In a similar fashion, we can take the derivatives of Eq. 2.2 with respect to time 
and obtain relationships between the accelerations 


oo fe ea. 
Ke a, =a, a, = 4a, 


a.=a (2.3) 
Equation 2.3 shows again that Newton’s laws hold for both observers. As long as 
u is constant (du/dt = 0), the observers measure identical accelerations and agree 
on the results of applying F = ma. 
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FIGURE 2.2 An object or event at 
point P is at coordinates x’, y’,z’ with 
respect to O'. The x coordinate mea- 
sured by Oisx = x’ + ut. The y andz 
coordinates in O are the same as those 
ind’. 
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| Example 2.1 





Two cars are traveling at constant speed along a road in the 
same direction. Car A moves at 60 km/h and car B moves 
at 40 km/h, each measured relative to an observer on the 
ground (Figure 2.3a). What is the speed of car A relative 
to car B? 


Solution 

Let O be the observer on the ground, who observes car A 
to move at v, = 60 km/h. Assume O’ to be moving with 
car B at u = 40 km/h. Then 


Vo = v, —u = 60km/h — 40km/h 
= 20km/h 


Figure 2.35 shows the situation as observed by O’. 


| Example 2.2 
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FIGURE 2.3 Example 2.1. (a) As observed by O at rest on the 
ground. (b) As observed by O’ in car B. 





An airplane is flying due east relative to still air at a speed 
of 320km/h. There is a 65 km/h wind blowing toward 
the north, as measured by an observer on the ground. 
What is the velocity of the plane measured by the ground 
observer? 


Solution 

Let O be the observer on the ground, and let O’ be an 
observer who is moving with the wind, for example a 
balloonist (Figure 2.4). Then u = 65 km/h, and (because 
our equations are set up with u in the xx’ direction) we 
must choose the xx’ direction to be to the north. In this 
case we know the velocity with respect to O’; taking the y 
direction to the east, we have v,. = 0 and v, = 320 km/h. 
Using Eq. 2.2 we obtain 


vy, =Vvo+u=0+ 65km/h = 65 km/h 


Vy = vi, = 320 km/h 


Relative to the ground, the plane flies in a direction 
determined by ¢ = tan~!(65 km/h) /(320 km/h) = 11.5°, 
or 11.5° north of east. 


eh op 





FIGURE 2.4 Example 2.2. As observed by O at rest on the 
ground, the balloon drifts north with the wind, while the plane 
flies north of east. 





| Example 2.3 





A swimmer capable of swimming at a speed c in still water 
is swimming in a stream in which the current is u (which 
we assume to be less than c). Suppose the swimmer swims 
upstream a distance L and then returns downstream to the 
starting point. Find the time necessary to make the round 
trip, and compare it with the time to swim across the stream 
a distance L and return. 


Solution 

Let the frame of reference of O be the ground and 
the frame of reference of O’ be the water, moving at 
speed wu (Figure 2.5a). The swimmer always moves at 
speed c relative to the water, and thus v. = —c for the 
upstream swim. (Remember that wu always defines the 
positive x direction.) According to Eq. 2.2, 4. =v, —u, 
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so v¥, =Vv.+u=u-—c. (As expected, the velocity rel- »O 
ative to the ground has magnitude smaller than c; it 


is also negative, since the swimmer is swimming in ( (Ew, 


the negative x direction, so |v,|=c—vu.) Therefore, 














tuy = Lic —u). For the downstream swim, v= c, so pane ee, [es 
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To swim directly across the stream, the swimmer’s efforts al 
must be directed somewhat upstream to counter the effect (EX, 
of the current (Figure 2.55). That is, in the frame of ref- 











erence of O we would like to have v, = 0, which requires ' 
Vv. = —u according to Eq. 2.2. Since the speed relative to 
the water is always c, Jv? + v7 = c; thus v, = yc? — v? ! ON rs as 
= ,/c? — uv’, and the round-trip time is 
| 
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FIGURE 2.5 Example 2.3. The motion of a swimmer as seen 
Notice the difference in form between this result and the _ by observer O at rest on the bank of the stream. Observer O’ 
result for the upstream-downstream swim, Eq. 2.4. moves with the stream at speed u. 





2.2 THE MICHELSON-MORLEY EXPERIMENT 


We have seen how Newton’s laws remain valid with respect to a Galilean transfor- 
mation that relates the description of the motion of an object in one reference frame 
to that in another reference frame. It is then interesting to ask whether the same 
transformation rules apply to the motion of a light beam. According to the Galilean 
transformation, a light beam moving relative to observer O’ in the x’ direction at 
speed c = 299,792,458 m/s would have a speed of c + uw relative to O. Direct high- 
precision measurements of the speed of light beams have become possible in recent 
years (as we discuss later in this chapter), but in the 19th century it was necessary 
to devise a more indirect measurement of the speed of light according to different 
observers in relative motion. 

Suppose the swimmer in Example 2.3 is replaced by a light beam. Observer : 
O’ is in a frame of reference in which the speed of light is c, and the frame of Albert A. Michelson (1852-1931, 
reference of observer O’ is in motion relative to observer O. What is the speed United States). He spent 50 years 
of light as measured by observer O? If the Galilean transformation is correct, we doing increasingly precise experi- 
should expect to see a difference between the speed of the light beam according ments with light, for which he became 
to O and O’ and therefore a time difference between the upstream-downstream _ the first U.S. citizen to win the Nobel 
and cross-stream times, as in Example 2.3. Prize in physics (1907). 





30 Chapter 2 | The Special Theory of Relativity 























FIGURE 2.6 (Top) Beam diagram of 
Michelson interferometer. Light from 
source S is split at A by the half-silvered 
mirror; one part is reflected by the 
mirror at B and the other is reflected 
at C. The beams are then recom- 
bined for observation of the interfer- 
ence. (Bottom) Michelson’s appara- 
tus. To improve sensitivity, the beams 
were reflected to travel each leg of the 
apparatus eight times, rather than just 
twice. To reduce vibrations from the 
surroundings, the interferometer was 
mounted on a 1.5-m square stone slab 
floating in a pool of mercury. 


Physicists in the 1 9th century postulated just such a situation—a preferred frame 
of reference in which the speed of light has the precise value of c and other frames in 
relative motion in which the speed of light would differ, according to the Galilean 
transformation. The preferred frame, like that of observer O’ in Example 2.3, is 
one that is at rest with respect to the medium in which light propagates at c (like 
the water of that example). What is the medium of propagation for light waves? It 
was inconceivable to physicists of the 19th century that a wave disturbance could 
propagate without a medium (consider mechanical waves such as sound or seismic 
waves, for example, which propagate due to mechanical forces in the medium). 
They postulated the existence of an invisible, massless medium, called the ether, 
which filled all space, was undetectable by any mechanical means, and existed solely 
for the propagation of light waves. It seemed reasonable then to obtain evidence 
for the ether by measuring the velocity of the Earth moving through the ether. This 
could be done in the geometry of Figure 2.5 by measuring the difference between 
the upstream-downstream and cross-stream times for a light wave. The calculation 
based on Galilean relativity would then give the relative velocity u between O (in 
the Earth’s frame of reference) and the ether. 

The first detailed and precise search for the preferred frame was performed 
in 1887 by the American physicist Albert A. Michelson and his associate 
E. W. Morley. Their apparatus consisted of a specially designed Michelson 
interferometer, illustrated in Figure 2.6. A monochromatic beam of light is split 
in two; the two beams travel different paths and are then recombined. Any 
phase difference between the combining beams causes bright and dark bands or 
“fringes” to appear, corresponding, respectively, to constructive and destructive 
interference, as shown in Figure 2.7. 

There are two contributions to the phase difference between the beams. The 
first contribution comes from the path difference AB — AC; one of the beams may 
travel a longer distance. The second contribution, which would still be present 
even if the path lengths were equal, comes from the time difference between the 
upstream-downstream and cross-stream paths (as in Example 2.3) and indicates 
the motion of the Earth through the ether. Michelson and Morley used a clever 
method to isolate this second contribution—they rotated the entire apparatus by 
90°! The rotation doesn’t change the first contribution to the phase difference 
(because the lengths 4B and AC don’t change), but the second contribution 
changes sign, because what was an upstream-downstream path before the rotation 
becomes a cross-stream path after the rotation. As the apparatus is rotated through 
90°, the fringes should change from bright to dark and back again as the phase 
difference changes. Each change from bright to dark represents a phase change of 
180° (a half cycle), which corresponds to a time difference of a half period (about 
10—!5 s for visible light). Counting the number of fringe changes thus gives a 
measure of the time difference between the paths, which in turn gives the relative 
velocity u. (See Problem 3.) 

When Michelson and Morley performed their experiment, there was no 
observable change in the fringe pattern—they deduced a shift of less than 0.01 
fringe, corresponding to a speed of the Earth through the ether of at most 5 km/s. 
As a last resort, they reasoned that perhaps the orbital motion of the Earth just 
happened to cancel out the overall motion through the ether. If this were true, 


six months later (when the Earth would be moving in its orbit in the opposite 
direction) the cancellation should not occur. When they repeated the experiment 
six months later, they again obtained a null result. In no experiment were 
Michelson and Morley able to detect the motion of the Earth through the ether. 

In summary, we have seen that there is a direct chain of reasoning that leads 
from Galileo’s principle of inertia, through Newton’s laws with their implicit 
assumptions about space and time, ending with the failure of the Michelson- 
Morley experiment to observe the motion of the Earth relative to the ether. 
Although several explanations were offered for the unobservability of the ether 
and the corresponding failure of the upstream-downstream and cross-stream 
velocities to add in the expected way, the most novel, revolutionary, and ultimately 
successful explanation is given by Einstein’s special theory of relativity, which 
requires a serious readjustment of our traditional concepts of space and time, and 
therefore alters some of the very foundations of physics. 


2.3 EINSTEIN’S POSTULATES 


The special theory of relativity is based on two postulates proposed by Albert 
Einstein in 1905: 
The principle of relativity: The laws of physics are the same in all inertial 
reference frames. 
The principle of the constancy of the speed of light: The speed of light in 
free space has the same value c in all inertial reference frames. 


The first postulate declares that the laws of physics are absolute, universal, and 
the same for all inertial observers. Laws that hold for one inertial observer cannot 
be violated for any inertial observer. 

The second postulate is more difficult to accept because it seems to go against 
our “common sense,” which is based on the Galilean kinematics we observe in 
everyday experiences. Consider three observers A, B, and C. Observer B is at rest, 
while A and C move away from B in opposite directions each at a speed of c/4. B 
fires a light beam in the direction of A. According to the Galilean transformation, 
if B measures a speed of c for the light beam, then A measures a speed of 
c—c/4 = 3c/4, while C measures a speed of c + c/4 = 5c/4. Einstein’s second 
postulate, on the other hand, requires all three observers to measure the same 
speed of c for the light beam! This postulate immediately explains the failure of 
the Michelson-Morley experiment—the upstream-downstream and cross-stream 
speeds are identical (both are equal to c), so there is no phase difference between the 
two beams. 

The two postulates also allow us to dispose of the ether hypothesis. The first 
postulate does not permit a preferred frame of reference (all inertial frames are 
equivalent), and the second postulate does not permit only a single frame of 
reference in which light moves at speed c, because light moves at speed c in all 
frames. The ether, as a preferred reference frame in which light has a unique 
speed, is therefore unnecessary. 
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FIGURE 2.7 Interference fringes as 
observed with the Michelson inter- 
ferometer of Figure 2.6. When the path 
length ACA changes by one-half wave- 
length relative to ABA, all light areas 
turn dark and all dark areas turn light. 
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Albert Einstein (1879-1955, Ger- 
many-United States). A gentle philoso- 
pherand pacifist, he was the intellectual 
leader of two generations of theoretical 
physicists and left his imprint on nearly 
every field of modern physics. 
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FIGURE 2.8 The clock ticks at inter- 
vals At) determined by the time for 
a light flash to travel the distance 2L, 
from the light source S to the mirror 
M and back to the source where it 
is detected. (We assume the emission 
and detection occur at the same loca- 
tion, so the beam travels perpendicular 
to the mirror). 











2.4 CONSEQUENCES OF EINSTEIN’S POSTULATES 


Among their many consequences, Einstein’s postulates require a new considera- 
tion of the fundamental nature of time and space. In this section we discuss how 
the postulates affect measurements of time and length intervals by observers in 
different frames of reference. 


The Relativity of Time 


To demonstrate the relativity of time, we use the timing device illustrated in 
Figure 2.8. It consists of a flashing light source S that is a distance Ly from a 
mirror M. A flash of light from the source is reflected by the mirror, and when 
the light returns to S the clock ticks and triggers another flash. The time interval 
between ticks is the distance 2) (assuming the light travels perpendicular to the 
mirror) divided by the speed c: 
Aty = 2L9/c (2.6) 

This is the time interval that is measured when the clock is at rest with respect to 
the observer. 

We consider two observers: O is at rest on the ground, and O’ moves with speed 
u. Each observer carries a timing device. Figure 2.9 shows a sequence of events that 
O observes for the clock carried by O’. According to O, the flash is emitted when 
the clock of O’ is at .A, reflected when it is at B, and detected at C. In this interval 
At, O observes the clock to move forward a distance of uA¢ from the point at which 
the flash was emitted, and O concludes that the light beam travels a distance 2L, 
where L = 4/ i + (uAt/2)* ,as shown in Figure 2.9. Because O observes the light 
beam to travel at speed c (as required by Einstein’s second postulate) the time 
interval measured by O is 


2L 214 + (uAt/2)" 


























At= (2.7) 
c c 
Substituting for Ly from Eq. 2.6 and solving Eq. 2.7 for At, we obtain 
At 
At = ———*— (2.8) 
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FIGURE 2.9 In the frame of reference of O, the clock carried by O’ moves with speed 
u. The dashed line, of length 2Z, shows the path of the light beam according to O. 


2.4 | Consequences of Einstein’s Postulates 33 


According to Eq. 2.8, observer O measures a longer time interval than O’ measures. 
This is a general result of special relativity, which is known as time dilation. An 
observer O’ is at rest relative to a device that produces a time interval Af. For this 
observer, the beginning and end of the time interval occur at the same location, 
and so the interval Af) is known as the proper time. An observer O, relative to 
whom O” is in motion, measures a longer time interval At for the same device. 
The dilated time interval A? is always longer than the proper time interval A‘, 
no matter what the magnitude or direction of U. 

This is a real effect that applies not only to clocks based on light beams but 
also to time itself; all clocks run more slowly according to an observer in relative 
motion, biological clocks included. Even the growth, aging, and decay of living 
systems are slowed by the time dilation effect. However, note that under normal 
circumstances (u < c), there is no measurable difference between Af and A‘, 
so we don’t notice the effect in our everyday activities. Time dilation has been 
verified experimentally with decaying elementary particles as well as with precise 
atomic clocks carried aboard aircraft. Some experimental tests are discussed in 
the last section of this chapter. 


| Example 2.4 





Muons are elementary particles with a (proper) life- 
time of 2.2 js. They are produced with very high 
speeds in the upper atmosphere when cosmic rays (high- 
energy particles from space) collide with air molecules. 
Take the height Ly of the atmosphere to be 100 km in the 
reference frame of the Earth, and find the minimum speed 
that enables the muons to survive the journey to the surface 
of the Earth. 


Solution 

The birth and decay of the muon can be considered as the 
“ticks” of a clock. In the frame of reference of the Earth 
(observer O) this clock is moving, and therefore its ticks are 
slowed by the time dilation effect. If the muon is moving at 
a speed that is close to c, the time necessary for it to travel 
from the top of the atmosphere to the surface of the Earth is 


im 100km 


At= = 
c 3.00 x 108 m/s 





= 333 ps 


If the muon is to be observed at the surface of the Earth, 
it must live for at least 333 js in the Earth’s frame of 
reference. In the muon’s frame of reference, the interval 
between its birth and decay is a proper time interval of 
2.2 ys. The time intervals are related by Eq. 2.8: 


2.2 Us 


jraee 


333 us = 
Solving, we find 
u = 0.999978c 
If it were not for the time dilation effect, muons would 
not survive to reach the Earth’s surface. The observation 


of these muons is a direct verification of the time dilation 
effect of special relativity. 





The Relativity of Length 


For this discussion, the moving timing device of O’ is turned sideways, so that 
the light travels parallel to the direction of motion of O’. Figure 2.10 shows the 
sequence of events that O observes for the moving clock. According to O, the 
length of the clock (distance between the light source and the mirror) is L; as we 
shall see, this length is different from the length Ly) measured by O’, relative to 
whom the clock is at rest. 

The flash of light is emitted when the clock of O’ is at A and reaches the mirror 
(position B) at time Af, later. In this time interval, the light travels a distance 
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FIGURE 2.10 Here the clock carried by O’ emits its light flash in the direction of 
motion. 





























c At,, equal to the length L of the clock plus the additional distance u At, that the 
mirror moves forward in this interval. That is, 


cAt, =L+uAt (2.9) 
1 1 


The flash of light travels from the mirror to the detector in a time Af, and covers 
a distance of c At,, equal to the length Z of the clock less the distance u At, that 
the clock moves forward in this interval: 


c At = L—uAt (2.10) 
Solving Eqs. 2.9 and 2.10 for At, and At), and adding to find the total time 
interval, we obtain 


L L 2L 1 


ates a ar ctu c l= /c? 





(2.11) 
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FIGURE 2.11 Some length-contracted objects. Notice that the shortening occurs only in the direction of motion. 
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From Eq. 2.8, 
Ato 2Lo 1 


Jt-<e2 e¢ J/1—u/c? 


Setting Eqs. 2.11 and 2.12 equal to one another and solving, we obtain 


L=LpV1-— u?/c? (2.13) 


Equation 2.13 summarizes the effect known as length contraction. Observer O’, 
who is at rest with respect to the object, measures the rest length Lp (also known 
as the proper length, in analogy with the proper time). All observers relative to 
whom 0’ is in motion measure a shorter length, but only along the direction of 
motion; length measurements transverse to the direction of motion are unaffected 
(Figure 2.11). 

For ordinary speeds (u < c), the effects of length contraction are too small to 
be observed. For example, a rocket of length 100 m traveling at the escape speed 
from Earth (11.2 km/s) would appear to an observer on Earth to contract only by 
about two atomic diameters! 

Length contraction suggests that objects in motion are measured to have a 
shorter length than they do at rest. The objects do not actually shrink; there is 
merely a difference in the length measured by different observers. For example, 
to observers on Earth a high-speed rocket ship would appear to be contracted 
along its direction of motion (Figure 2.12a), but to an observer on the ship it is 
the passing Earth that appears to be contracted (Figure 2.125). 

These representations of length-contracted objects are somewhat idealized. () 

The actual appearance of a rapidly moving object is determined by the time at 
which light leaves the various parts of the object and enters the eye or the camera. FIGURE 2.12 (a) The Earth views the 





At= (2.12) 








The result is that the object appears distorted in shape and slightly rotated. passing contracted rocket. (b) From 
the rocket’s frame of reference, the 
Example 2.5 Earth appears contracted. 





Consider the point of view of an observer who is moving To the observer in the muon’s rest frame, the moving Earth 
toward the Earth at the same velocity as the muon. In has an atmosphere of height given by Eq. 2.13: 
this reference frame, what is the apparent thickness of the 


Earth’s atmosphere? 
L=LyvVl—w/2 
Solution 
In this observer’s reference frame, the muon is at rest and = (100 km),/ 1—(0.999978)? = 0.66 km=660m 


the Earth is rushing toward it at a speed of u = 0.999978c, 
as we found in Example 2.4. To an observer on the Earth, This distance is small enough for the muons to reach the 
the height of the atmosphere is its rest length Ly) of 100km. _Earth’s surface within their lifetime. 





Note that what appears as a time dilation in one frame of reference (the observer 
on Earth) can be regarded as a /ength contraction in another frame of reference 
(the observer traveling with the muon). For another example of this effect, let’s 
review again the example of the pion decay discussed in Section 1.2. A pion at 
rest has a lifetime of 26.0ns. According to observer O, at rest in the laboratory 
frame of reference, a pion moving through the laboratory at a speed of 0.913c 
has a longer lifetime, which can be calculated to be 63.7 ns (using Eq. 2.8 for the 
time dilation). According to observer O,, who is traveling through the laboratory 
at the same velocity as the pion, the pion appears to be at rest and has its proper 
lifetime of 26.0 ns. Thus O, sees a time dilation effect. 
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O, erects two markers in the laboratory, at the locations where the pion is 
created and decays. To O,, the distance between those markers is the pion’s speed 
times its lifetime, which works out to be 17.4 m. Suppose O, places a stick of 
length 17.4 m in the laboratory connecting the two markers. That stick is at rest 
in the laboratory reference frame and so has its proper length in that frame. In 
the reference frame of O,, the stick is moving at a speed of 0.913c and has a 
shorter length of 7.1 m, which we can find using the length contraction formula 
(Eq. 2.13). So O, measures a distance of 7.1 m between the locations in the 
laboratory where the pion was created and where it decayed. 

Note that O, measures the proper length and the dilated time, while O, 
measures the proper time and the contracted length. The proper time and proper 
length must always be referred to specific observers, who might not be in the same 
reference frame. The proper time is always measured by an observer according to 
whom the beginning of the time interval and the end of the time interval occur at 
the same location. If the time interval is the lifetime of the pion, then O, (relative 
to whom the pion does not move) sees its creation and decay at the same location 
and thus measures the proper time interval. The proper length, on the other hand, 
is always measured by an observer according to whom the measuring stick is at 
rest (O, in this case). 


| Example 2.6 


An observer O is standing on a platform of length 
Dy = 65 m ona space station. A rocket passes at a relative 
speed of 0.80c moving parallel to the edge of the platform. 
The observer O notes that the front and back of the rocket 
simultaneously line up with the ends of the platform at a 
particular instant (Figure 2.13a). (a) According to O, what 
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FIGURE 2.13 Example 2.6. (a) From the reference frame of O 
at rest on the platform, the passing rocket lines up simultane- 
ously with the front and back of the platform. (b,c) From the 
reference frame O’ in the rocket, the passing platform lines up 
first with the front of the rocket and later with the rear. Note 
the differing effects of length contraction in the two reference 
frames. 


is the time necessary for the rocket to pass a particular 
point on the platform? (6) What is the rest length Ly of the 
rocket? (c) According to an observer O’ on the rocket, what 
is the length D of the platform? (d) According to O’, how 
long does it take for observer O to pass the entire length 
of the rocket? (e) According to O, the ends of the rocket 
simultaneously line up with the ends of the platform. Are 
these events simultaneous to O'? 


Solution 

(a) According to O, the length L of the rocket matches the 
length D, of the platform. The time for the rocket to pass a 
particular point is measured by O to be 


= L _ 65 m 
~ 0.80c 2.40 x 108 m/s 





Ato = 0.27 us 

This is a proper time interval, because O measures the 
interval between two events that occur at the same point in 
the frame of reference of O (the front of the rocket passes 
a point, and then the back of the rocket passes the same 
point). 

(b) O measures the contracted length L of the rocket. We 
can find its proper length Ly using Eq. 2.13: 


L _ 65m 
Vl—w/c2 1 — (0.80)? 


(c) According to O the platform is at rest, so 65 m is its 
proper length Dy. According to O’, the contracted length of 


o= 








the platform is therefore 


D=DjV1— 1? /c? = (65 m)y 1 — (0.80)? = 39m 


(d) For O to pass the entire length of the rocket, O’ con- 
cludes that O must move a distance equal to its rest length, 
or 108 m. The time needed to do this is 

_ 108m 


a S045 
0.80¢ ee 


At 


Note that this is not a proper time interval for O’, who deter- 
mines this time interval using one clock at the front of the 
rocket to measure the time at which O passes the front of the 
rocket, and another clock on the rear of the rocket to mea- 
sure the time at which O passes the rear of the rocket. The 
two events therefore occur at different points in O’ and so 
cannot be separated by a proper time in O’. The correspond- 
ing time interval measured by O for the same two events, 
which we calculated in part (a), is a proper time interval for 
O, because the two events do occur at the same point in O. 
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The time intervals measured by O and O’ should be related 
by the time dilation formula, as you should verify. 


(e) According to O’, the rocket has a rest length of Ly = 
108 mand the platform has a contracted length of D = 39m. 
There is thus no way that O’ could observe the two ends 
of both to align simultaneously. The sequence of events 
according to O’ is illustrated in Figures 2.13b and. The time 
interval A?’ in O' between the two events that are simulta- 
neous in O can be calculated by noting that, according to O’, 
the time interval between the situations shown in Figures 
2.136 and c must be that necessary for the platform to move 
a distance of 108 m — 39 m= 69 m, which takes a time 
69m 
t= peu 0.29 us 

This result illustrates the relativity of simultaneity: two 
events at different locations that are simultaneous to O (the 
lining up of the two ends of the rocket with the two ends of 
the platform) cannot be simultaneous to O’. 





Relativistic Velocity Addition 


The timing device is now modified as shown in Figure 2.14. A source P emits 
particles that travel at speed v’ according to an observer O’ at rest with respect 
to the device. The flashing bulb F is triggered to flash when a particle reaches it. 
The flash of light makes the return trip to the detector D, and the clock ticks. The 
time interval Af) between ticks measured by O’ is composed of two parts: one for 
the particle to travel the distance Ly at speed v’ and another for the light to travel 


the same distance at speed c: 


Aty = Lo/V¥ +L/c 


(2.14) 


According to observer O, relative to whom O’ moves at speed u, the sequence of 
events is similar to that shown in Figure 2.10. The emitted particle, which travels 
at speed v according to O, reaches F' in a time interval Af, after traveling the 
distance v At, equal to the (contracted) length LZ plus the additional distance wu At, 


moved by the clock in that interval: 


vAt, =L+uAt, 


In the interval Af, the light beam travels a distance c At, equal to the length L 


less the distance u At, moved by the clock in that interval: 


cAt =L—udAt 


We now solve Eqs. 2.15 and 2.16 for At, and At, add to find the total interval 
At between ticks according to O, use the time dilation formula, Eq. 2.8, to relate 
this result to Af, from Eq. 2.14, and finally use the length contraction formula, 
Eq. 2.13, to relate L to Ly. After doing the algebra, we find the result 


v+u 
v= ———_ 
1+vu/c? 


(2.15) 


(2.16) 











FIGURE 2.14 In this timing device, 
a particle is emitted by P at a speed 
v. When the particle reaches F, it 
triggers the emission of a flash of light 


(2.17) 
that travels to the detector D. 
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| Example 2.7 


Equation 2.17 is the relativistic velocity addition law for velocity components 
that are in the direction of wu. Later in this chapter we use a different method to 
derive the corresponding results for motion in other directions. 

We can also regard Eq. 2.17 as a velocity transformation, enabling us to convert 
a velocity v measured by O’ to a velocity v measured by O. The corresponding 
classical law was given by Eq. 2.2: v = Vv + u. The difference between the 
classical and relativistic results is the denominator of Eq. 2.17, which reduces to 
1 in cases when the speeds are small compared with c. Example 2.7 shows how 
this factor prevents the measured speeds from exceeding c. 

Equation 2.17 gives an important result when O’ observes a light beam. 
For Vv =<¢, 

c+u 


= —_ = 2.18 
: 1+ cu/c? 7 ety) 


That is, when Vv = c, then v = c, independent of the value of u. All observers 
measure the same speed c for light, exactly as required by Einstein’s second 
postulate. 





A spaceship moving away from the Earth at a speed of 
0.80c fires a missile parallel to its direction of motion 
(Figure 2.15). The missile moves at a speed of 0.60c rela- 
tive to the ship. What is the speed of the missile as measured 
by an observer on the Earth? 


v’=0.60c 


u=0.80c 


FIGURE 2.15 Example 2.7. A spaceship moves away from 
Earth at a speed of 0.80c. An observer O’ on the spaceship 
fires a missile and measures its speed to be 0.60c relative to 
the ship. 


Solution 
Here O’ is on the ship and O is on Earth; O’ moves with 
a speed of u = 0.80c relative to O. The missile moves at 


speed v = 0.60c relative to O’, and we seek its speed v 
relative to O. Using Eq. 2.17, we obtain 


Vv+u 0.60c + 0.80c 


= Te vule 1+ (0.60c)(0.80c)/e2 





= 1.40c 


148 = 0.95c 





According to classical kinematics (the numerator of 
Eq. 2.17), an observer on the Earth would see the mis- 
sile moving at 0.60c + 0.80c = 1.40c, thereby exceeding 
the maximum relative speed of c permitted by relativ- 
ity. You can see how Eq. 2.17 brings about this speed 
limit. Even if v’ were 0.9999. ..c and u were 0.9999 ...c, 
the relative speed v measured by O would remain less 
than c. 





The Relativistic Doppler Effect 


In the classical Doppler effect for sound waves, an observer moving relative to 
a source of waves (sound, for example) detects a frequency different from that 
emitted by the source. The frequency f’ heard by the observer O is related to the 
frequency f emitted by the source S according to 





oe eG 


VF Vs 





fi=f (2.19) 
where v is the speed of the waves in the medium (such as still air, in the case of 
sound waves), vy is the speed of the source relative to the medium, and Vg is the 
speed of the observer relative to the medium. The upper signs in the numerator 
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and denominator are chosen whenever S moves toward O or O moves toward S, 
while the lower signs apply whenever O and S move away from one another. 

The classical Doppler shift for motion of the source differs from that for 
motion of the observer. For example, suppose the source emits sound waves at 
f = 1000 Hz. If the source moves at 30 m/s toward the observer who is at rest 
in the medium (which we take to be air, in which sound moves at v = 340 m/s), 
then f’ = 1097 Hz, while if the source is at rest in the medium and the observer 
moves toward the source at 30 m/s, the frequency is 1088 Hz. Other possibilities 
in which the relative speed between S and O is 30 m/s, such as each moving 
toward the other at 15 m/s, give still different frequencies. 

Here we have a situation in which it is not the relative speed of the source and 
observer that determines the Doppler shift—it is the speed of each with respect 
to the medium. This cannot occur for light waves, since there is no medium 
(no “ether’) and no preferred reference frame by Einstein’s first postulate. We 
therefore require a different approach to the Doppler effect for light waves, an 
approach that does not distinguish between source motion and observer motion, 
but involves only the relative motion between the source and the observer. 

Consider a source of waves that is at rest in the reference frame of observer 
O. Observer O’ moves relative to the source at speed u. We consider the situation 
from the frame of reference of O’, as shown in Figure 2.16. Suppose O observes 
the source to emit NV waves at frequency f. According to O, it takes an interval 
At, = N/f for these N waves to be emitted; this is a proper time interval in the 
frame of reference of O. The corresponding time interval to O’ is Af’, during 
which O moves a distance u At’. The wavelength according to O’ is the total 
length interval occupied by these waves divided by the number of waves: 

cAt+uAt cAt+uAfr 


N= — 2.20 
N fAty ney 





The frequency according to O' is f’ = c/A’, so 


f — pAb 1 
~* Af 1+u/e 





(2.21) 


and using the time dilation formula, Eq. 2.8, to relate Ar’ and Afp, we obtain 


pgvlee/e _.jl=ule 
fas 1+u/c =f 1+u/c ee 


This is the formula for the relativistic Doppler shift, for the case in which the 
waves are observed in a direction parallel to u. Note that, unlike the classical 
formula, it does not distinguish between source motion and observer motion; the 
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FIGURE 2.16 A source of waves, in the reference frame of O, moves 
at speed u away from observer O’. In the time A?’ (according to O’), O 
moves a distance u A?’ and emits N waves. 
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relativistic Doppler effect depends only on the relative speed u between the source 


and observer. 


Equation 2.22 assumes that the source and observer are separating. If the 
source and observer are approaching one another, replace u by —u in the formula. 


| Example 2.8 





A distant galaxy is moving away from the Earth at such 
high speed that the blue hydrogen line at a wavelength 
of 434nm is recorded at 600 nm, in the red range of the 
spectrum. What is the speed of the galaxy relative to the 
Earth? 


Solution 
Using Eq. 2.22 with f = c/d and f’ = c/d’, we obtain 


1—u/c 
1+u/c 


Cc = Cc 
600nm 434nm 


Solving, we find 





u/c = 0.31 


Thus the galaxy is moving away from Earth at a speed 
of 0.31¢ = 9.4 x 10’m/s. Evidence obtained in this way 


indicates that nearly all the galaxies we observe are moving 
away from us. This suggests that the universe is expanding, 
and is usually taken to provide evidence in favor of the Big 
Bang theory of cosmology (see Chapter 15). 


c ¢ j1l—ufe 
MN AV 1 +u/c 





2.9 THE LORENTZ TRANSFORMATION 


We have seen that the Galilean transformation of coordinates, time, and velocity 
is not consistent with Einstein’s postulates. Although the Galilean transformation 
agrees with our “common-sense” experience at low speeds, it does not agree with 
experiment at high speeds. We therefore need a new set of transformation equations 
that replaces the Galilean set and that is capable of predicting such relativistic effects 
as time dilation, length contraction, velocity addition, and the Doppler shift. 

As before, we seek a transformation that enables observers O and O’ in 
relative motion to compare their measurements of the space and time coordinates 
of the same event. The transformation equations relate the measurements of O 
(namely, x, y, z, t) to those of O’ (namely, x’, y’, z’, ’). This new transformation 
must have several properties: It must be linear (depending only on the first power 
of the space and time coordinates), which follows from the homogeneity of space 
and time; it must be consistent with Einstein’s postulates; and it must reduce to the 
Galilean transformation when the relative speed between O and O’ is small. We 
again assume that the velocity of O’ relative to O is in the positive xx’ direction. 

This new transformation consistent with special relativity is called the Lorentz 
transformation*. Its equations are 


(2 (2.23a) 

= A a 
JV1—u?/c? 

yoy (2.23b) 


*H. A. Lorentz (1853-1928) was a Dutch physicist who shared the 1902 Nobel Prize for his work 
on the influence of magnetic fields on light. In an unsuccessful attempt to explain the failure of the 
Michelson-Morley experiment, Lorentz developed the transformation equations that are named for 
him in 1904, a year before Einstein published his special theory of relativity. For a derivation of the 
Lorentz transformation, see R. Resnick and D. Halliday, Basic Concepts in Relativity (New York, 
Macmillan, 1992). 
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Z=y (2.23c) 


_ 2 
Fm dahl (2.23d) 


JV1—u?/c? 
It is often useful to write these equations in terms of intervals of space and time 
by replacing each coordinate by the corresponding interval (replace x by Ax, x’ 
by Ax’, t by At,? by Af’). 

These equations are written assuming that O’ moves away from O in the xx’ 
direction. If O’ moves toward O, replace u with —u in the equations. 

The first three equations reduce directly to the Galilean transformation for 
space coordinates, Eqs. 2.1, when uv < c. The fourth equation, which links the 
time coordinates, reduces to ¢ = t, which is a fundamental postulate of the 
Galilean-Newtonian world. 

We now use the Lorentz transformation equations to derive some of the 
predictions of special relativity. The problems at the end of the chapter guide you 
in some other derivations. The results derived here are identical with those we 
obtained previously using Einstein’s postulates, which shows that the equations of 
the Lorentz transformation are consistent with the postulates of special relativity. 





Length Contraction 


A rod of length Ly is at rest in the reference frame of observer O’. The rod extends 
along the x’ axis from x/ to x4; that is, O’ measures the proper length Ly = x, — x‘. 
Observer O, relative to whom the rod is in motion, measures the ends of the rod 
to be at coordinates x, and x,. For O to determine the length of the moving rod, 
O must make a simultaneous determination of x; and x,, and then the length is 
L =x, —x,. Suppose the first event is O’ setting off a flash bulb at one end of the 
rod at x and ¢,, which O observes at x, and ¢,, and the second event is O’ setting 
off a flash bulb at the other end at x, and 4, which O observes at x, and f,. The 
equations of the Lorentz transformation relate these coordinates, specifically, 


xX, — uty Xq — Ul 


5 ee ea eee aa (2.24) 
: V1 —u?/c?2 - JV1—u?/c? 
Subtracting these equations, we obtain 
_ t, -—t¢ 
go." He) (2.25) 








~ Jl-w/e J1—u2/c? 


O’ must arrange to set off the flash bulbs so that the flashes appear to be 
simultaneous to O. (They will not be simultaneous to O’, as we discuss later in this 
section.) This enables O to make a simultaneous determination of the coordinates 
of the endpoints of the rod. If O observes the flashes to be simultaneous, then 
t, = t,, and Eq. 2.25 reduces to 

y xy *4 


x—x = (2.26) 


V1 —u2/c? 
With x, — x = Ly and x, — x, = L, this becomes 
L=LoV1-— wu? /c? (2.27) 


which is identical with Eq. 2.13, which we derived earlier using Einstein’s 
postulates. 
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Clock1 = [@} > Clock 2 


Hg 


x=0 





x=L/2 x=L 
FIGURE 2.17 A flash of light, emitted 
from a point midway between the two 
clocks, starts the two clocks simulta- 
neously according to O. Observer O’ 
sees clock 2 start ahead of clock 1. 


Velocity Transformation 


If O observes a particle to travel with velocity v (components v,, v,,v,), what 
velocity v’ does O’ observe for the particle? The relationship between the velocities 
measured by O and O’ is given by the Lorentz velocity transformation: 


Vy u 


=i ae 72 (2.28) 
» Wall = fe? 

Vy = Tove (2.28b) 
, Vel — u?/c? 

vy, = “Tove (2.28c) 


By solving Eq. 2.28a for v,, you can show that it is identical to Eq. 2.17, a result 
we derived previously based on Einstein’s postulates. Note that, in the limit of 
low speeds (wu < c), the Lorentz velocity transformation reduces to the Galilean 
velocity transformation, Eq. 2.2. Note also that v,, 4 v,, even though y = y. This 
occurs because of the way the Lorentz transformation handles the time coordinate. 

We can derive these transformation equations for velocity from the Lorentz 
coordinate transformation. By way of example, we derive the velocity trans- 
formation for v,, = dy'/dt'. Differentiating the coordinate transformation ¥ = y, 
we obtain dy’ = dy. Similarly, differentiating the time coordinate transformation 
(Eq. 2.23d), we obtain 


d= dt — (u/c*)dx 
V1 — u/c? 
So 
,_ ay dy 
Y= 


dt dtu) dy 1-2/2 
fe 2e— vy 1 — 7A 


1 — (u/c?) dx/dt de uv,/c? 
Similar methods can be used to obtain the transformation equations for v’. and 
v.. These derivations are left as exercises (Problem 14). 








d 
Vvl—wu?/c? = 


dt — (u/c?) dx 





Simultaneity and Clock Synchronization 


Under ordinary circumstances, synchronizing one clock with another is a simple 
matter. But for scientific work, where timekeeping at a precision below the 
nanosecond range is routine, clock synchronization can present some significant 
challenges. At very least, we need to correct for the time that it takes for the signal 
showing the reading on one clock to be transmitted to the other clock. However, 
for observers who are in motion with respect to each other, special relativity gives 
yet another way that clocks may appear to be out of synchronization. 

Consider the device shown in Figure 2.17. Two clocks are located at x = 0 and 
x = L.A flash lamp is located at x = L/2, and the clocks are set running when they 
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receive the flash of light from the lamp. The light takes the same interval of time 
to reach the two clocks, so the clocks start together precisely at a time L/2c after 
the flash is emitted, and the clocks are exactly synchronized. 

Now let us examine the same situation from the point of view of the moving 
observer O’. In the frame of reference of O, two events occur: the receipt of a 
light signal by clock 1 at x; = 0,¢, = L/2c and the receipt of a light signal by 
clock 2 atx, = L,t, = L/2c. Using Eq. 2.23d, we find that O' observes clock 1 to 
receive its signal at 











fe. a 
Vl-w/e  J1—w/c? 
while clock 2 receives its signal at 
pi ty — (u/c?)x> _ L/2¢- (u/c?)L 
(2.30) 


a V1 — u/c? 7 J1l—u2/c2 


Thus #, is smaller than ¢, and clock 2 appears to receive its signal earlier than 
clock 1, so that the clocks start at times that differ by 


uL /c* 
V1 —u?/c? 


according to O’. Keep in mind that this is not a time dilation effect—time 
dilation comes from the first term of the Lorentz transformation (Eq. 2.23d) for f, 
while the lack of synchronization arises from the second term. O' observes both 
clocks to run slow, due to time dilation; O’ also observes clock 2 to be ahead of 
clock 1. 

We therefore reach the following conclusion: two events that are simultaneous 
in one reference frame are not simultaneous in another reference frame moving 
with respect to the first, unless the two events occur at the same point in space. 
(if ZL = 0, Eq. 2.31 shows that the clocks are synchronized in all reference 
frames.) Clocks that appear to be synchronized in one frame of reference will not 
necessarily be synchronized in another frame of reference in relative motion. 

It is important to note that this clock synchronization effect does not depend 
on the /ocation of observer O’ but only on the velocity of O’. In Figure 2.17, the 
location of O’ could have been drawn far to the left side of clock 1 or far to the 
right side of clock 2, and the result would be the same. In those different locations, 
the propagation time of the light signal showing clock | starting will differ from 
the propagation time of the light signal showing clock 2 starting. However, O' 
is assumed to be an “intelligent” observer who is aware of the locations where 
the light signals showing the two clocks starting are received relative to the 
locations of the clocks. O’ corrects for this time difference, which is due only to 
the propagation time of the light signals, and even after making that correction 
the clocks still do not appear to be synchronized! 

Although the location of O’ does not appear in Eq. 2.31, the direction of 
the velocity of O’ is important—if O’ is moving in the opposite direction, the 
observed starting order of the two clocks is reversed. 


Af=t,-H= 





(2.31) 
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| Example 2.9 





Two rockets are leaving their space station along perpen- 
dicular paths, as measured by an observer on the space 
station. Rocket | moves at 0.60c and rocket 2 moves at 
0.80c, both measured relative to the space station. What is 
the velocity of rocket 2 as observed by rocket 1? 


Solution 
Observer O is the space station, observer O’ is rocket 1 
(moving at wu = 0.60c), and each observes rocket 2, moving 
(according to O) in a direction perpendicular to rocket 1. 
We take this to be the y direction of the reference frame of 
O. Thus O observes rocket 2 to have velocity components 
v, = 0,v, = 0.80c, as shown in Figure 2.18a. 

We can find v, and v, using the Lorentz velocity trans- 
formation: 

v,—U 0 — 0.60c 


¥=— = = —0.60c 
l—vu/e2 1 —0(0.60c)/c? 


, Wvl- u2 /c2 


7 1 —v,u/c? 


_ 0.80e/ 1 — (0.60)? /c? 


1 — 0(0.60c) /c? 





= 0.64c 





Thus, according to O’, the situation looks like Figure 2.18. 
The speed of rocket 2 according to O' is 
- (0.60c)? + (0.64c)? = 0.88c, less than c. According to 





| Example 2.10 


the Galilean transformation, v,, would be identical with v,,, 
and thus the speed would be \/(0.60c)? + (0.80c)? = c. 
Once again, the Lorentz transformation prevents relative 
speeds from reaching or exceeding the speed of light. 





Rocket 2 
v,=0 
vy = 0.80c 


— 1 
u=O0. en 
se 2 
=-0.60c 
vy = 0.64c 


ex 1 
u=—0.60c 


FIGURE 2.18 Example 2.9. (a) As viewed from the reference 
frame of O. (b) As viewed from the reference frame of O’. 





In Example 2.6, two events that were simultaneous to O 
(the lining up of the front and back of the rocket ship with 
the ends of the platform) were not simultaneous to O’. Find 
the time interval between these events according to O’. 


Solution 
According to O, the two simultaneous events are separated 
by a distance of L = 65 m. For u = 0.80c, Eq. 2.31 gives 


ule 





_ 0.80)(65 m)/(3.00 x 108 m/s) 


— u2/c2 





= 0.29 us 


v1 — (0.80)? 


which agrees with the result calculated in part (e) of 





2.6 THE TWIN PARADOX 


We now turn briefly to what has become known as the twin paradox. Suppose there 
is a pair of twins on Earth. One, whom we shall call Casper, remains on Earth, while 
his twin sister Amelia sets off in a rocket ship on a trip to a distant planet. Casper, 
based on his understanding of special relativity, knows that his sister’s clocks will 


tun slow relative to his own and that therefore she should be younger than he when 
she returns, as our discussion of time dilation would suggest. However, recalling 
that discussion, we know that for two observers in relative motion, each thinks the 
other’s clocks are running slow. We could therefore study this problem from the 
point of view of Amelia, according to whom Casper and the Earth (accompanied 
by the solar system and galaxy) make a round-trip journey away from her and 
back again. Under such circumstances, she will think it is her brother’s clocks 
(which are now in motion relative to her own) that are running slow, and will 
therefore expect her brother to be younger than she when they meet again. While 
it is possible to disagree over whose clocks are running slow relative to his or her 
own, which is merely a problem of frames of reference, when Amelia returns to 
Earth (or when the Earth returns to Amelia), all observers must agree as to which 
twin has aged less rapidly. This is the paradox—each twin expects the other to be 
younger. 

The resolution of this paradox lies in considering the asymmetric role of the 
two twins. The laws of special relativity apply only to inertial frames, those 
moving relative to one another at constant velocity. We may supply Amelia’s 
rockets with sufficient thrust so that they accelerate for a very short length of time, 
bringing the ship to a speed at which it can coast to the planet, and thus during 
her outward journey Amelia spends all but a negligible amount of time in a frame 
of reference moving at constant speed relative to Casper. However, in order to 
return to Earth, she must decelerate and reverse her motion. Although this also 
may be done in a very short time interval, Amelia’s return journey occurs in a 
completely different inertial frame than her outward journey. It is Amelia’s jump 
from one inertial frame to another that causes the asymmetry in the ages of the 
twins. Only Amelia has the necessity of jumping to a new inertial frame to return, 
and therefore all observers will agree that it is Amelia who is “really” in motion, 
and that it is her clocks that are “really” running slow; therefore she is indeed the 
younger twin on her return. 

Let us make this discussion more quantitative with a numerical example. We 
assume, as discussed above, that the acceleration and deceleration take negligible 
time intervals, so that all of Amelia’s aging is done during the coasting. For 
simplicity, we assume the distant planet is at rest relative to the Earth; this does 
not change the problem, but it avoids the need to introduce yet another frame of 
reference. Suppose the planet to be 6 light-years distant from Earth, and suppose 
Amelia travels at a speed of 0.6c. Then according to Casper it takes his sister 
10 years (10 years x0.6c = 6 light-years) to reach the planet and 10 years to 
return, and therefore she is gone for a total of 20 years. (However, Casper doesn’t 
know his sister has reached the planet until the light signal carrying news of her 
arrival reaches Earth. Since light takes 6 years to make the journey, it is 16 years 
after her departure when Casper sees his sister’s arrival at the planet. Four years 
later she returns to Earth.) From the frame of reference of Amelia aboard the 
rocket, the distance to the planet is contracted by a factor of \/1 — (0.6) = 0.8, 
and is therefore 0.8 x 6 light-years = 4.8 light-years. At a speed of 0.6c, Amelia 
will measure 8 years for the trip to the planet, for a total round trip time of 16 years. 
Thus Casper ages 20 years while Amelia ages only 16 years and is indeed the 
younger on her return. 

We can confirm this analysis by having Casper send a light signal to his sister 
each year on his birthday. We know that the frequency of the signal as received 
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FIGURE 2.19 A spacetime diagram. 
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FIGURE 2.20 Casper’s spacetime di- 
agram, showing his worldline and 
Amelia’s. 


by Amelia will be Doppler shifted. During the outward journey, she will receive 
signals at the rate of 


1—u/c 
1+u/c 


During the return journey, the Doppler-shifted rate will be 





(1/year) = 0.5/year 


1+u/c 
1—u/c 


Thus for the first 8 years, during Amelia’s trip to the planet, she receives 4 signals, 
and during the return trip of 8 years, she receives 16 signals, for a total of 20. She 
receives 20 signals, indicating her brother has celebrated 20 birthdays during her 
16-year journey. 





(1/year) = 2/year 


Spacetime Diagrams 


A particularly helpful way of visualizing the journeys of Casper and Amelia uses 
a spacetime diagram. Figure 2.19 shows an example of a spacetime diagram for 
motion that involves only one spatial direction. 

In your introductory physics course, you probably became familiar with 
plotting motion on a graph in which distance appeared on the vertical axis and 
time on the horizontal axis. On such a graph, a straight line represents motion 
at constant velocity; the slope of the line is equal to the velocity. Note that the 
axes of the spacetime diagram are switched from the traditional graph of particle 
motion, with time on the vertical axis and space on the horizontal axis. 

On a spacetime diagram, the graph that represents the motion of a particle is 
called its worldline. The inverse of the slope of the particle’s worldline gives its 
velocity. Equivalently, the velocity is given by the tangent of the angle that the 
worldline makes with the vertical axis (rather than with the horizontal axis, as 
would be the case with a conventional plot of distance vs. time). Usually, the units 
of x and ¢ are chosen so that motion at the speed of light is represented by a line 
with a 45° slope. A vertical line represents a particle that is at the same spatial 
locations at all times—that is, a particle at rest. Permitted motions with constant 
velocity are then represented by straight lines between the vertical and the 45° 
line representing the maximum velocity. 

Let’s draw the worldlines of Casper and Amelia according to Casper’s frame 
of reference. Casper’s worldline is a vertical line, because he is at rest in this frame 
(Figure 2.20). In Casper’s frame of reference, 20 years pass between Amelia’s 
departure and her return, so we can follow Casper’s vertical worldline for 
20 years. 

Amelia is traveling at a speed of 0.6c, so her worldline makes an angle with 
the vertical whose tangent is 0.6 (31°). In Casper’s frame of reference, the planet 
visited by Amelia is 6 light-years from Earth. Amelia travels a distance of 6 
light-years in a time of 10 years (according to Casper) so that v = 6 light-years/10 
years = 0.6c. 

The birthday signals that Casper sends to Amelia at the speed of light are 
represented by the series of 45° lines in Figure 2.20. Amelia receives 4 birthday 
signals during her outbound journey (the 4th arrives just as she reaches the planet) 
and 16 birthday signals during her return journey (the 16th is sent and received 
just as she returns to Earth). 


It is left as an exercise (Problems 22 and 24) to consider the situation if it is 
Amelia who is sending the signals. 


2.7 RELATIVISTIC DYNAMICS 


We have seen how Einstein’s postulates have led to a new “relative” interpretation 
of such previously absolute concepts as length and time, and that the classical 
concept of absolute velocity is not valid. It is reasonable then to ask how far this 
revolution is to go in changing our interpretation of physical concepts. Dynamical 
quantities, such as momentum and kinetic energy, depend on length, time, and 
velocity. Do classical laws of momentum and energy conservation remain valid 
in Einstein’s relativity? 

Let’s test the conservation laws by examining the collision shown in 
Figure 2.2la. Two particles collide elastically as observed in the reference 
frame of O’. Particle 1 of mass m, = 2m is initially at rest, and particle 2 of 
mass m, = m is moving in the negative x direction with an initial velocity of 
v5, = —0.750c. Using the classical law of momentum conservation to analyze 
this collision, O’ would calculate the particles to be moving with final velocities 
Vip = —0.500c and v5, = +0.250c. According to O’, the total initial and final 
momenta of the particles would be: 


pl = mv, + mv, = (2m)(0) + (n)(—0.750c) = —0.750me 
Pe = MVi¢ + MyVo¢ = (2m)(—0.500c) + (m)(0.250c) = —0.750mc 


The initial and final momenta are equal according to O’, demonstrating that 
momentum is conserved. 

Suppose that the reference frame of O’ moves at a velocity of uw = +0.550c in 
the x direction relative to observer O, as in Figure 2.21b. How would observer 
O analyze this collision? We can find the initial and final velocities of the two 
particles according to O using the velocity transformation of Eq. 2.17, which gives 


2.7 | Relativistic Dynamics 47 



































y 4 
2m Wi 2m m 
7) eo —— xo 
v5; =—0.750c v1; = 0.550c V9, =—-0.340c 
= mel u=0.550¢ 
2m m 2m m 
-©@ a eo oe 
Vie Vor Vig V2F 
Final Final 
x x 
(a) (b) 


FIGURE 2.21 (a) A collision between two particles as observed from the reference frame of O’. 


(b) The same collision observed from the reference frame of O. 
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the initial velocities shown in the figure and the final velocities vj, = +0.069c 
and v5-¢ = +0.703c. Observer O can now calculate the initial and final values of 
the total momentum of the two particles: 


Py = MV; + MyV2; = (2m)(+0.550c) + (m)(—0.340c) = +0.760mc 
Pp = My Vy ¢ + MyVo¢ = (2m)(+0.069c) + (m)(+0.703c) = +0.841 mc 


Momentum is therefore not conserved according to observer O. 

This collision experiment has shown that that the law of conservation of linear 
momentum, with momentum defined as p = mv, does not satisfy Einstein’s first 
postulate (the law must be the same in all inertial frames). We cannot have a 
law that is valid for some observers but not for others. Therefore, if we are to 
retain the conservation of momentum as a general law consistent with Einstein’s 
first postulate, we must find a new definition of momentum. This new definition 
of momentum must have two properties: (1) It must yield a law of conservation 
of momentum that satisfies the principle of relativity; that is, if momentum is 
conserved according to an observer in one inertial frame, then it is conserved 
according to observers in all inertial frames. (2) At low speeds, the new definition 
must reduce to p = mv, which we know works perfectly well in the nonrelativistic 
case. 

These requirements are satisfied by defining the relativistic momentum for a 
particle of mass m moving with velocity Vv as 


“ my 
p = ————— (2.32) 
V1 —v?/c? 
In terms of components, we can write Eq. 2.32 as 
. mv. y 
me and ee (2.33) 


Fe Sie ye = ia 


The velocity v that appears in the denominator of these expressions is always the 
velocity of the particle as measured in a particular inertial frame. It is not the 
velocity of an inertial frame. The velocity in the numerator can be any of the 
components of the velocity vector. 

We can now reanalyze the collision shown in Figure 2.21 using the relativistic 
definition of momentum. The initial relativistic momentum according to O’ is 


ee mV; rn" MyV; _ au) (m)(—0.750c) =) Sade 
re — fe vn je VI-P V1 — (0.750) 














The final velocities according to O’ are vi; = —0.585c and v5, = +0.294c, and 
the total final momentum is 


v v, 
p= mMVig es MyVo¢ 
Jl-vale [1 —vRie 
__ (2m)(—0.585e)  (m)(0.294c) 
V1 — (0.585)? /1 — (0.294)? 








1.134mc 
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Thus p; = p, and observer O’ concludes that momentum is conserved. According 
to O, the initial relativistic momentum is 


mi; mV, (2m)(+0.550c)  (m)(—0.340c) 


-_ + =0.956mc 
ae ji ye  Vi— (0.550 1 = (0.340)? 











Using the velocity transformation, the final velocities measured by O are v,- = 
—0.051¢ and v¢ = +0.727c, and so O calculates the final momentum to be 


mMVi¢ 4 MyVx¢ __ (2m)(—0.051c) | (m)(+0.727c) 


=0.956mc 
ae file J1— 0.0512 1 — (0.727? 








Observer O also concludes that p; = py and that the law of conservation of 
momentum is valid. Defining momentum according to Eq. 2.32 gives conservation 
of momentum in a// reference frames, as required by the principle of relativity. 


| Example 2.11 











What is the momentum of a proton moving at a speed of —mve———some*(v/c) (938 MeV)(0.86) 
v = 0.86c? ao Vl —v/c? 7 Vl —v/c2 7 V1 — (0.86)2 
Solution = 1580 MeV 
Using Eq. 2.32, we obtain Here we have used the proton’s rest energy mc*, which 
mv is defined later in this section. The momentum is obtained 
B= Java from this result by dividing by the symbol c (not its 


numerical value), which gives 
_ (1.67 x 10-77 kg)(0.86)(3.00 x 10% m/s) 


7 (0.862 p = 1580 MeV/c 


= 8.44 x 10°’ kg- m/s The units of MeV/c for momentum are often used in rela- 

tivistic calculations because, as we show later, the quantity 

The units of kg- m/s are generally not convenient in solv- pc often appears in these calculations. You should be able 

ing problems of this type. Instead, we manipulate Eq. 2.32 to convert MeV/c to kg- m/s and show that the two results 
to obtain obtained for p are equivalent. 








Relativistic Kinetic Energy 


Like the classical definition of momentum, the classical definition of kinetic 
energy also causes difficulties when we try to compare the interpretations of 
different observers. According to O’, the initial and final kinetic energies in the 
collision shown in Figure 2.21a are: 

7 
Ki= 


1m v2 + Lm,vz = (0.5)(2m)(0)? + (0.5) (m)(—0.750c)? = 0.281 me? 
5m 


Kt = Lm,v2. + 4m v2. = (0.5) (2m)(—0.500c)? + (0.5)(m) (0.250c)? = 0.28 1c? 
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and so energy is conserved according to O’. The initial and final kinetic energies 
observed from the reference frame of O (as in Figure 2.215) are 


K, = 4m,v3. + 4m,v3. = (0.5)(2m)(0.550c)? + (0.5)(m)(—0.340c)* = 0.360mc* 
2 Ua ge 21 


Kp = myVig + 5yV3—¢ = (0.5)(2m)(0.069c)? + (0.5)(m)(0.703c)? = 0.252me? 
Thus energy is not conserved in the reference frame of O if we use the classical 
formula for kinetic energy. This leads to a serious inconsistency—an elastic 
collision for one observer would not be elastic for another observer. As in the 
case of momentum, if we want to preserve the law of conservation of energy for 
all observers, we must replace the classical formula for kinetic energy with an 
expression that is valid in the relativistic case (but that reduces to the classical 
formula for low speeds). 

We can derive the relativistic expression for the kinetic energy of a particle 
using essentially the same procedure used to derive the classical expression, 
starting with the particle form of the work-energy theorem (see Problem 28). The 
result of this calculation is 


mc” 


K = —————. - mc’ (2.34) 


JI—-Vje 


Using Eq. 2.34, you can show that both O and O’ will conclude that kinetic energy 
is conserved. In fact, all observers will agree on the applicability of the energy 
conservation law using the relativistic definition for kinetic energy. 

Equation 2.34 looks very different from the classical result K = smy’, but, as 
you should show (see Problem 32), Eq. 2.34 reduces to the classical expression in 
the limit of low speeds (v < c). 

The classical expression for kinetic energy also violates the second relativity 
postulate by allowing speeds in excess of the speed of light. There is no limit 
(in either classical or relativistic dynamics) to the energy we can give to a 
particle. Yet, if we allow the kinetic energy to increase without limit, the classical 
expression K = imv? implies that the velocity must correspondingly increase 
without limit, thereby violating the second postulate. You can also see from the 
first term of Eq. 2.34 that K — oo as v > c. Thus we can increase the relativistic 
kinetic energy of a particle without limit, and its speed will not exceed c. 


Relativistic Total Energy and Rest Energy 


We can also express Eq. 2.34 as 
K=E-E, (2.35) 


where the relativistic total energy E is defined as 


mc 


a (2.36) 


ie 


and the rest energy Ep is defined as 
Ey = mc’ (2.37) 


The rest energy is in effect the relativistic total energy of a particle measured in a 
frame of reference in which the particle is at rest. 
Sometimes m in Eq. 2.37 is called the rest mass mo and is distinguished from 


the “relativistic mass,” which is defined as my/,/ 1 — v?/c?. We choose not to use 
relativistic mass, because it can be a misleading concept. Whenever we refer to 
mass, we always mean rest mass. 

Equation 2.37 suggests that mass can be expressed in units of energy divided 
by c’, such as MeV/c’. For example, a proton has a rest energy of 938 MeV and 
thus a mass of 938 MeV/c’. Just like expressing momentum in units of MeV/c, 
expressing mass in units of MeV/c’ turns out to be very useful in calculations. 

The relativistic total energy is given by Eq. 2.35 as 


E=K+E) (2.38) 


Collisions of particles at high energies often result in the production of new 
particles, and thus the final rest energy may not be equal to the initial rest energy 
(see Example 2.18). Such collisions must be analyzed using conservation of total 
relativistic energy E; kinetic energy will not be conserved when the rest energy 
changes in a collision. In the special example of the elastic collision considered 
in this section, the identities of the particles did not change, and so kinetic energy 
was conserved. In general, collisions do not conserve kinetic energy—it is the 
relativistic total energy that is conserved in collisions. 

Manipulation of Eqs. 2.32 and 2.36 gives a useful relationship among the total 
energy, momentum, and rest energy: 


E= VJ (pe)? + (mc?)? (2.39) 


Figure 2.22 shows a useful mnemonic device for remembering this relationship, 
which has the form of the Pythagorean theorem for the sides of a right triangle. 

When a particle travels at a speed close to the speed of light (say, v > 0.99c), 
which often occurs in high-energy particle accelerators, the particle’s kinetic 
energy is much greater than its rest energy; that is, K >> Ep. In this case, Eq. 2.39 
can be written, to a very good approximation, 


E=pc (2.40) 


This is called the extreme relativistic approximation and is often useful for 
simplifying calculations. As v approaches c, the angle in Figure 2.22 between the 
bottom leg of the triangle (representing mc’) and the hypotenuse (representing £) 
approaches 90°. Imagine in this case a very tall triangle, in which the vertical leg 
(pc) and the hypotenuse (£) are nearly the same length. 

For massless particles (such as photons), Eq. 2.39 becomes exactly 


E=pce (2.41) 


All massless particles travel at the speed of light; otherwise, by Eqs. 2.34 and 
2.36 their kinetic and total energies would be zero. 
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Eg =mc 


FIGURE 2.22 A useful mnemonic 
device for recalling the relationships 
among Ey,p,K, and E. Note that to 
put all variables in energy units, the 
quantity pc must be used. 
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| Example 2.12 





What are the kinetic and relativistic total energies of a 
proton (Ey = 938 MeV) moving at a speed of v = 0.86c? 


Solution 
In Example 2.11 we found the momentum of this particle 
to be p = 1580 MeV/c. The total energy can be found from 
Eq. 2.39: 


E = (pe)? + (me?)? = ri (1580 MeV)” + (938 MeV)” 


= 1837MeV 





| Example 2.13 


The kinetic energy follows from Eq. 2.35: 


K=E-E) 
= 1837 MeV — 938 MeV 
= 899 MeV 


We also could have solved this problem by finding the 
kinetic energy directly from Eq. 2.34. 





Find the velocity and momentum of an electron (Ey = 
0.511 MeV) with a kinetic energy of 10.0 MeV. 


Solution 
The total energy isE = K + Ey = 10.0 MeV + 0.511 MeV 
= 10.51 MeV. We then can find the momentum from 
Eq. 2.39: 


1 1 
p= -VE — (mc)? = ~,/(10.51 MeV)*— (0.511 MeV)’ 
ce c 
= 10.5 MeV/c 





Note that in this problem we could have used the 
extreme relativistic approximation, p = E'/c, from Eq. 2.40. 
The error we would make in this case would be 
only 0.1%. 

The velocity can be found by solving Eq. 2.36 for v. 


ve : mc? ae i 0.511 MeV \” 
c E}) 10.51 MeV 


= 0.9988 





(2.42) 





| Example 2.14 


In the Stanford Linear Collider electrons are accelerated to a 
kinetic energy of 50 GeV. Find the speed of such an electron 
as (a) a fraction of c, and (b) a difference from c. The rest 
energy of the electron is 0.511 MeV = 0.511 x 10-?GeV. 


Solution 
(a) First we solve Eq. 2.34 for v, obtaining 





1 


and thus 








1 
v=c/l— 5 
[1 + (50 GeV)/(0.511 x 10-3 GeV)] 
= 0.999 999 999 948c 


Calculators cannot be trusted to 12 significant digits. Here 
is a way to avoid this difficulty. We can write Eq. 2.43 
as v = c(1 + x)!/*, where x = —1/(1 + K/mc’)?. Because 
K > mc’, we have x < 1, and we can use the binomial 
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expansion to write v = c(1 + 5x), or 


1 
Vv = G 1 -— ———__ 
2(1 Tre 
which gives 


v= cl — 5.2 x 107") 


This leads to the same value of v given above. 
(b) From the above result, we have 


c—-v=5.2x10Ne 
= 0.016 m/s 
= 1.6cm/s 





| Example 2.15 





At a distance equal to the radius of the Earth’s orbit 
(1.5 x 10!!m), the Sun’s radiation has an intensity of 
about 1.4 x 10°? W/m’. Find the rate at which the mass of 
the Sun is decreasing. 


By conservation of energy, we know that the energy lost 
by the Sun through radiation must be accounted for by 
a corresponding loss in its rest energy. The change in 
mass Am corresponding to a change in rest energy AE of 


4.0 x 102° J each second is 





Solution 

If we assume that the Sun’s radiation is distributed uni- 26 

forml the surf 4x? of a sphere of radi ig a ee eat k 
m= = =44x 

ormly over the surface area 4zrr~ of a sphere of radius a2 5.0 x 10 m/s g 


1.5 x 10!! m, then the total radiative power emitted by the 


Sun is 
The Sun loses mass at a rate of about 4 billion kilograms 


per second! If this rate were to remain constant, the Sun 
(with a present mass of 2 x 10°°kg) would shine “only” 
for another 10!° years. 


Ar(1.5 x 10!'m)?(1.4 x 10? W/m?) 
= 4.0 x 10° W = 4.0 x 10° J/s 





2.8 CONSERVATION LAWS IN RELATIVISTIC 


DECAYS AND COLLISIONS 





In all decays and collisions, we must apply the law of conservation of momentum. 
The only difference between applying this law for collisions at low speed (as we 
did in Example 1.1) and at high speed is the use of the relativistic expression for 
momentum (Eq. 2.32) instead of Eq. 1.2. The law of conservation of momentum 
for relativistic motion can be stated in exactly the same way as for classical motion: 


In an isolated system of particles, the total linear momentum remains 

constant. 

In the classical case, kinetic energy is the only form of energy that is present in 
elastic collisions, so conservation of energy is equivalent to conservation of kinetic 
energy. In inelastic collisions or decay processes, the kinetic energy does not 
remain constant. Total energy is conserved in classical inelastic collisions, but we 
did not account for the other forms of energy that might be important. This missing 
energy is usually stored in the particles, perhaps as atomic or nuclear energy. 
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In the relativistic case, the internal stored energy contributes to the rest energy 
of the particles. Usually rest energy and kinetic energy are the only two forms 
of energy that we consider in atomic or nuclear processes (later we'll add the 
energy of radiation to this balance). A loss of kinetic energy in a collision is thus 
accompanied by a gain in rest energy, but the total relativistic energy (kinetic 
energy + rest energy) of all the particles involved in the process doesn’t change. 
For example, in a reaction in which new particles are produced, the loss in kinetic 
energy of the original reacting particles gives the increase in rest energy of the 
product particles. On the other hand, in a nuclear decay process such as alpha 
decay, the initial nucleus gives up some rest energy to account for the kinetic 
energy carried by the decay products. 

The law of energy conservation in the relativistic case is: 


Inan isolated system of particles, the relativistic total energy (kinetic energy 
plus rest energy) remains constant. 


In applying this law to relativistic collisions, we don’t have to worry whether the 
collision is elastic or inelastic, because the inclusion of the rest energy accounts 
for any loss in kinetic energy. 

The following examples illustrate applications of the conservation laws for 
relativistic momentum and energy. 


| Example 2.16 


A neutral K meson (mass 497.7 MeV/c”) is moving with a 
kinetic energy of 77.0 MeV. It decays into a pi meson (mass 
139.6 MeV/c’) and another particle of unknown mass. The 
pi meson is moving in the direction of the original K meson 
with a momentum of 381.6 MeV/c. (a) Find the momen- 
tum and total relativistic energy of the unknown particle. 
(6) Find the mass of the unknown particle. 


Solution 
(a) The total energy and momentum of the K meson are 


Ex = Ky +mgc? = 77.0 MeV + 497.7 MeV = 574.7 MeV 


1 
Pe -y Ex — (mc?) 


1 
= —\/(574.7 MeV)? — (497.7 MeV)? 
c 





= 287.4 MeV/c 
and for the pi meson 


E, = (cpz)? + (m,c?)? 


= (381.6 MeV)? + (139.6 MeV)2 
= 406.3 MeV 





Conservation of relativistic momentum (Pinitia) = Pfinal) 
gives Px = Pp, +p, (where x represents the unknown par- 
ticle), so 


Py = PK — Pq = 287.4 MeV/c — 381.6 MeV/c 
= —94.2 MeV/c 


and conservation of total relativistic energy (Ejnitia) = Efinai) 
gives Ex, = E, + Ey, so 


E, = Ex — E, = 574.7MeV — 406.3 MeV 
= 168.4MeV 


(b) We can find the mass by solving Eq. 2.39 for mc’: 


m,c” = ,/E2 — (cp,)? 


= \/(168.4 MeV)? — (94.2 MeV)? 
= 139.6 MeV 





Thus the unknown particle has a mass of 139.6 MeV/c’, 
and its mass shows that it is another pi meson. 
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| Example 2.17 





In the reaction K~> +p— A°+m°, a charged K 
meson (mass 493.7MeV/c?) collides with a proton 
(938.3 MeV/c”) at rest, producing a lambda particle 
(1115.7 MeV/c?) and a neutral pi meson (135.0 MeV/c’), 
as represented in Figure 2.23. The initial kinetic energy 
of the K meson is 152.4MeV. After the interaction, the 
pi meson has a kinetic energy of 254.8 MeV. (a) Find the 
kinetic energy of the lambda. (b) Find the directions of 
motion of the lambda and the pi meson. 

















(b) 











FIGURE 2.23 Example 2.17. (a) A K~ meson collides with a 
proton at rest. (b) After the collision, a 7° meson and a A° are 
produced. 


Solution 
(a) The initial and final total energies are 


Bieitiai _ Ex an Ey — Kx + myc + myc 


Efnat = Ey +E = Ky+myc°+K,+m,c° 
In these two equations, the value of every quantity is known 
except the kinetic energy of the lambda. Using conserva- 


tion of total relativistic energy, we set Ejnitias = Eginay 20 
solve for Ky: 


2 2 2 2 
Ky = Kx + ime’ + mc — myc — K, —m,c 


= 152.4MeV + 493.7 MeV + 938.3 MeV 
— 1115.7 MeV — 254.8 MeV — 135.0 MeV 
= 78.9 MeV 


(b) To find the directional information we must apply 
conservation of momentum. The initial momentum is just 
that of the K meson. From its total energy, Ex = Kx+ 
myc” = 152.4MeV + 493.7 MeV = 646.1 MeV, we can 
find the momentum: 


1 
Pinitial = PK = ou (Ex)? = (mc?)? 


1 
—/(646.1 MeV)? — (493.7 MeV)2 
Cc 





= 416.8 MeV/c 


A similar procedure applied to the two final particles 
gives p, = 426.9 MeV/c and p, = 365.7 MeV/c. The 
total momentum of the two final particles is p, gra = 
Pa Cos@ + p, cos and py fina) = Pa Sin — p, sing. Con- 
servation of momentum in the x and y directions gives 


Pp CO8O + Pp, COSH = Pinitias and py sind — p, sing = 0 


Here we have two equations with two unknowns 
(6 and @). We can eliminate 6 by writing the first equation as 
Pp COSO = Pinitia) — Pz COS , then squaring both equations 
and adding them. The resulting equation can be solved 
for @: 


2 2 9 
panes Pinitial + Px — PA 
2P x Pinitial 


(416.8 MeV/c)? + (365.7 MeV/c)” 
—(426.9 MeV/c)? 


2(365.7 MeV/c) (416.8 MeV/c) 


1 





= COS 


=65,7° 


From the conservation of momentum equation for the y 
components, we have 


6 = sin! (Gree =e) 
Pa 


. 1 { 365.7 MeV/c)(sin 65.7°) . 
= sin = 51.3 
426.9 MeV/c 
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| Example 2.18 





The discovery of the antiproton p (a particle with the same 
rest energy as a proton, 938 MeV, but with the opposite 
electric charge) took place in 1956 through the following 
reaction: 


ptp>ptptptp 


in which accelerated protons were incident on a target of 
protons at rest in the laboratory. The minimum incident 
kinetic energy needed to produce the reaction is called the 
threshold kinetic energy, for which the final particles move 
together as if they were a single unit (Figure 2.24). Find 
the threshold kinetic energy to produce antiprotons in this 
reaction. 











(a) (b) 














FIGURE 2.24 Example 2.18. (a) A proton moving with veloc- 
ity v collides with another proton at rest. (b) The reaction 
produces three protons and an antiproton, which move together 
as a unit. 


Solution 

This problem can be solved by a straightforward applica- 
tion of energy and momentum conservation. Let £, and p, 
represent the total energy and momentum of the incident 


proton. Thus the initial total energy of the two protons is 
te a myc. Let E,, and p, represent the total energy and 
momentum of each of the four final particles (which move 
together and thus have the same energy and momentum). 
We can then apply conservation of total energy: 


De / 
E, + myc = 4E, 
and conservation of momentum: 


Pp = 4Pp 


We can write the momentum equation as [ES _ (m,c)? = 


4, [ER _ ney, so now we have two equations in two 
unknowns (£, and £,). We eliminate £,, for example 
by solving the energy conservation equation for £, and 
substituting into the momentum equation. The result is 


= 2 
Ey = TmyC 


from which we can calculate the kinetic energy of the 
incident proton: 


K, = E, — myc” = 6m,c? = 6(938 MeV) = 5628 MeV 
= 5.628GeV 


The Bevatron accelerator at the Lawrence Berkeley Labo- 
ratory was designed with this experiment in mind, so that 
it could produce a beam of protons whose energy exceeded 
5.6 GeV. The discovery of the antiproton in this reaction 
was honored with the award of the 1959 Nobel Prize to the 
experimenters, Emilio Segre and Owen Chamberlain. 





2.9 EXPERIMENTAL TESTS OF SPECIAL RELATIVITY 


Because special relativity provided such a radical departure from the notions of 
space and time in classical physics, it is important to perform detailed experimental 
tests that can clearly distinguish between the predictions of special relativity and 
those of classical physics. Many tests of increasing precision have been done 
since the theory was originally presented, and in every case the predictions of 
special relativity are upheld. Here we discuss a few of these tests. 
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Universality of the Speed of Light 


The second relativity postulate asserts that the speed of light has the same value 
c for all observers. This leads to several types of experimental tests, of which we 
discuss two: (1) Does the speed of light change with the direction of travel? (2) 
Does the speed of light change with relative motion between source and observer? 

The Michelson-Morley experiment provides a test of the first type. This 
experiment compared the upstream-downstream and cross-stream speeds of light 
and concluded that they were equal within the experimental error. Equivalently, 
we may say that the experiment showed that there is no preferred reference frame 
(no ether) relative to which the speed of light must be measured. If there is an 
ether, the speed of the Earth through the ether is less than 5 km/s, which is much 
smaller than the Earth’s orbital speed about the Sun, 30 km/s. We can express 
their result as a difference Ac between the upstream-downstream and cross-stream 
speeds; the experiment showed that Ac/c < 3 x 107!°. 

To reconcile the result of the Michelson-Morley experiment with classical 
physics, Lorentz proposed the “ether drag” hypothesis, according to which 
the motion of the Earth through the ether caused an electromagnetic drag that 
contracted the arm of the interferometer in the direction of motion. This contraction 
was just enough to compensate for the difference in the upstream-downstream 
and cross-stream times predicted by the Galilean transformation. This hypothesis 
succeeds only when the two arms of the interferometer are of the same length. 
To test this hypothesis, a similar experiment was done in 1932 by Kennedy and 
Thorndike; in their experiment, the lengths of the interferometer arms differed by 
about 16 cm, the maximum distance over which light sources available at that time 
could remain coherent. The Kennedy-Thorndike experiment in effect tests the 
second question, whether the speed of light changes due to relative motion. Their 
result was Ac/c < 3 x 1078, which excludes the Lorentz contraction hypothesis 
as an explanation for the Michelson-Morley experiment. 

In recent years, these fundamental experiments have been repeated with 
considerably improved precision using lasers as light sources. Experimenters 
working at the Joint Institute for Laboratory Astrophysics in Boulder, Colorado, 
built an apparatus that consisted of two He-Ne lasers on a rotating granite 
platform. By electronically stabilizing the lasers, they improved the sensitivity 
of their apparatus by several orders of magnitude. Again expressing the result 
as a difference between the speeds along the two arms of the apparatus, this 
experiment corresponds to Ac/c < 8 x 107!5, an improvement of about 5 orders 
of magnitude over the original Michelson-Morley experiment. In a similar 
repetition of the Kennedy-Thorndike experiment using He-Ne lasers, they obtained 
Ac/c < 1 x 107'°, an improvement over the original experiment by a factor of 
300. [See A. Brillet and J. L. Hall, Physical Review Letters 42, 549 (1979); D. 
Hils and J. L. Hall, Physical Review Letters 64, 1697 (1990).] A considerable 
improvement in the Kennedy-Thorndike type of experiment has been made 
possible by comparing the oscillation frequency of a crystal with the frequency 
of a hydrogen maser (a maser is similar to a laser, but it uses microwaves rather 
than visible light). The experimenters measured for nearly one year, looking for a 
change in the relative frequencies as the Earth’s velocity changed. No effect was 
observed, leading to a limit of Ac/c < 2 x 107!*. [See P. Wolf et al., Physical 
Review Letters 90, 060402 (2003).] 

Another way of testing the second question is to measure the speed of a light 
beam emitted by a source in motion. Suppose we observe this beam along the 
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direction of motion of the moving source, which might be moving toward us or 
away from us. In the rest frame of the source, the emitted light travels at speed c. 
We can express the speed of light in our reference frame as c! = c + Ac, where 
Ac is zero according to special relativity (c’ = c) or is uw according to classical 
physics (c’ =c+u in the Galilean transformation, depending on whether the 
motion is toward or away from the observer). 

In one experiment of this type, the decay of pi mesons (pions) into gamma 
rays (a form of electromagnetic waves traveling at c) was observed. When pions 
(produced in laboratories with large accelerators) emit these gamma rays, they are 
traveling at speeds close to the speed of light, relative to the laboratory. Thus if 
Galilean relativity were valid, we should expect to find gamma rays emitted in the 
direction of motion of the decaying pions traveling at a speed c’ in the laboratory of 
nearly 2c, rather than always with c as predicted by special relativity. The observed 
laboratory speed of these gamma rays in one experiment was (2.9977 + 0.0004) 
x 108 m/s when the decaying pions were moving at u/c = 0.99975. These results 
give Ac/c < 2 x 10-4, and thus c’ = c as expected from special relativity. This 
experiment shows directly that an object moving at a speed of nearly c relative to 
the laboratory emits “light” that travels at a speed of c relative to both the object 
and the laboratory, giving direct evidence for Einstein’s second postulate. [See T. 
Alvager et al., Physics Letters 12, 260 (1964).] 

Another experiment of this type is to study the X rays emitted by a binary 
pulsar, a rapidly pulsating source of X rays in orbit about another star, which 
would eclipse the pulsar as it rotated in its orbit. If the speed of light (in this case, 
X rays) were to change as the pulsar moved first toward and later away from the 
Earth in its orbit, the beginning and end of the eclipse would not be equally spaced 
in time from the midpoint of the eclipse. No such effect is observed, and from 
these observations it is concluded that Ac/c < 2 x 107'*, in agreement with 
predictions of special relativity. These experiments were done at u/c = 107°. 
[See K. Brecher, Physical Review Letters 39, 1051 (1977).] 

A different type of test of the limit by which the speed of light changes with 
direction of travel can be done using the clocks carried aboard the network of Earth 
satellites that make up the Global Positioning System (GPS). By comparing the 
readings of clocks on the GPS satellites with clocks on the ground at different times 
of day (as the satellites move relative to the ground stations), it is possible to test 
whether the change in the direction of travel affects the apparent synchronization 
of the clocks. No effect was observed, and the experimenters were able to set a 
limit of Ac/c < 5 x 10~° for the difference between the one-way and round-trip 
speeds of light. [See P. Wolf and G. Petit, Physical Review A 56, 4405 (1997).] 








Time Dilation 


We have already discussed the time dilation effect on the decay of muons produced 
by cosmic rays. Muon decay can also be studied in the laboratory. Muons can be 
produced following collisions in high-energy accelerators, and the decay of the 
muons can be followed by observing their decay products (ordinary electrons). 
These muons can either be trapped and decay at rest, or they can be placed 
in a beam and decay in flight. When muons are observed at rest, their decay 
lifetime is 2.198 ws. (As we discuss in Chapter 12, decays generally follow an 
exponential law. The lifetime is the time after which a fraction 1/e = 0.368 of 
the original muons remain.) This is the proper lifetime, measured in a frame of 
reference in which the muon is at rest. In one particular experiment, muons were 
trapped in a ring and circulated at a momentum of p = 3094 MeV/c. The decays 
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in flight occurred with a lifetime of 64.37 zs (measured in the laboratory frame 
of reference). For muons of this momentum, Eq. 2.8 gives a dilated lifetime of 
(see Problem 43) 64.38 ws, which is in excellent agreement with the measured 
value and confirms the time dilation effect. [See J. Bailey et al., Nature 268, 301 
(1977).] 

Another similar experiment was done with pions. The proper lifetime, measured 
for pions at rest, is known to be 26.0 ns. In one experiment, pions were observed 
in flight at w/c = 0.913, and their lifetime was measured to be 63.7 ns. (Pions 
decay to muons, so we can follow the exponential radioactive decay of the pions 
by observing the muons emitted as a result of the decay.) For pions moving at 
this speed, the expected dilated lifetime is in exact agreement with the measured 
value, once again confirming the time dilation effect. [See D. S. Ayres et al., 
Physical Review D 3, 1051 (1971).] 


The Doppler Effect 


Confirmation of the relativistic Doppler effect first came from experiments done 
in 1938 by Ives and Stilwell. They sent a beam of hydrogen atoms, generated 
in a gas discharge, down a tube at a speed wu, as shown in Figure 2.25. They 
could simultaneously observe light emitted by the atoms in a direction parallel 
to u (atom 1) and opposite to u (atom 2, reflected from the mirror). Using a 
spectrograph, the experimenters were able to photograph the characteristic spectral 
lines from these atoms and also, on the same photographic plate, from atoms at 
rest. If the classical Doppler formula were valid, the wavelengths of the lines 
from atoms | and 2 would be placed at symmetric intervals AA, = Ag (u/c) on 
either side of the line from the atoms at rest (wavelength A,), as in Figure 2.255. 
The relativistic Doppler formula, on the other hand, gives a small additional 
asymmetric shift AA, = +5Ag(u/c)’, as in Figure 2.25c (computed for u « c, so 
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FIGURE 2.25 (a) Apparatus used in the Ives-Stilwell experiment. (5) Line 
spectrum expected from classical Doppler effect. (c) Line spectrum expected 
from relativistic Doppler effect. 
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FIGURE 2.26 Results of the Ives- 
Stilwell experiment. According to 
classical theory, AA, =0, while 
according to special relativity, AA, 
depends on (u/c)?. The solid line, 
which represents the relativistic for- 
mula, gives excellent agreement with 
the data points. 


that higher-order terms in u/c can be neglected). Figure 2.26 shows the results of 
Ives and Stilwell for one of the hydrogen lines (the blue line of the Balmer series 
at Ay = 486 nm). The agreement between the observed values and those predicted 
by the relativistic formula is impressive. 

Recent experiments with lasers have verified the relativistic formula at greater 
accuracy. These experiments are based on the absorption of laser light by an 
atom; when the radiation is absorbed, the atom changes from its lowest-energy 
state (the ground state) to one of its excited states. The experiment consists 
essentially of comparing the laser wavelength needed to excite atoms at rest 
with that needed for atoms in motion. One experiment used a beam of hydrogen 
atoms with kinetic energy 800 MeV (corresponding to u/c = 0.84) produced in a 
high-energy proton accelerator. An ultraviolet laser was used to excite the atoms. 
This experiment verified the relativistic Doppler effect to an accuracy of about 
3 x 1074. [See D. W. MacArthur et al., Physical Review Letters 56, 282 (1986).] 
In another experiment, a beam of neon atoms moving with a speed of u = 0.0036c 
was irradiated with light from a tunable dye laser. This experiment verified the 
relativistic Doppler shift to a precision of 2 x 10~°. [See R. W. McGowan et al., 
Physical Review Letters 70, 251 (1993).] A more recent study used two tunable 
dye lasers parallel and antiparallel to a beam of lithium atoms moving at 0.064c. 
The results of this experiment agreed with the relativistic Doppler formula to 
within a precision of 2 x 10~’, improving on the best previous results by an order 
of magnitude. [See G. Saathoff et al., Physical Review Letters 91, 190403 (2003).] 


Relativistic Momentum and Energy 


The earliest direct confirmation of the relativistic relationship for energy and 
momentum came just a few years after Einstein’s 1905 paper. Simultaneous 
measurements were made of the momentum and velocity of high-energy electrons 
emitted in certain radioactive decay processes (nuclear beta decay, which is 
discussed in Chapter 12). Figure 2.27 shows the results of several different 
investigations plotted as p/mv, which should have the value | according to 
classical physics. The results agree with the relativistic formula and disagree with 
the classical one. Note that the relativistic and classical formulas give the same 
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FIGURE 2.27 The ratio p/mv is plotted for electrons of various speeds. The data agree 
with the relativistic result and not at all with the nonrelativistic result (p/mv = 1). 
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results at low speeds, and in fact the two cannot be distinguished for speeds below 
0.1 c, which accounts for our failure to observe these effects in experiments with 
ordinary laboratory objects. 

Other more recent experiments, in which the kinetic energies of fast electrons 
were measured, are shown in Figure 2.28. Once again, the data at high speeds 
agree with special relativity and disagree with the classical equations. In a 
more extreme example, experimenters at the Stanford Linear Accelerator Center 
measured the speed of 20 GeV electrons, whose speed is within 5 x 107!° of the 
speed of light (or about 0.15 m/s less than c). The measurement was not capable 
of this level of precision, but it did determine that the speed of the electrons was 
within 2 x 10~7 of the speed of light (60 m/s). [See Z. G. T. Guiragossian et al., 
Physical Review Letters 34, 335 (1975).] 

Nearly every time the nuclear or particle physicist enters the laboratory, a direct 
or indirect test of the momentum and energy relationships of special relativity 
is made. Principles of special relativity must be incorporated in the design of 
the high-energy accelerators used by nuclear and particle physicists, so even the 
construction of these projects gives testimony to the validity of the formulas of 
special relativity. 

For example, consider the capture of a neutron by an atom of hydrogen to form 
an atom of deuterium or “heavy hydrogen.” Energy is released in this process, 
mostly in the form of electromagnetic radiation (gamma rays). The energy of the 
gamma rays is measured to be 2.224 MeV. Where does this energy come from? 
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FIGURE 2.28 Confirmation of relativistic kinetic energy relationships. In (a) and (b) the momentum and energy of 
radioactive decay electrons were measured simultaneously. In these two independent experiments, the data were plotted in 
different ways, but the results are clearly in good agreement with the relativistic relationships and in poor agreement with 
the classical, nonrelativistic relationships. In (c) electrons were accelerated to a fixed energy through a large electric field 
(up to 4.5 million volts, as shown) and the velocities of the electrons were determined by measuring the flight time over 
8.4 m. Notice that at small kinetic energies (K << mc~), the relativistic and nonrelativistic relationships become identical. 
[Sources: (a) K. N. Geller and R. Kollarits, Am. J. Phys. 40, 1125 (1972); (6) S. Parker, Am. J. Phys. 40, 241 (1972); 
(c) W. Bertozzi, Am. J. Phys. 32, 551 (1964).]. 
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It comes from the difference in mass when the hydrogen and neutron combine to 
form deuterium. The difference between the initial and final masses is: 


Am = m(hydrogen) + m(neutron) — m(deuterium) 
= 1.007825 u+ 1.008665 u — 2.014102 u = 0.002388 u 


The initial mass of hydrogen plus neutron is greater than the final mass of 
deuterium by 0.002388 u. The energy equivalent of this change in mass is 


AE = (Am)c” = 2.224MeV 


which is equal to the energy released as gamma rays. 

Similar experiments have been done to test the E = mc? relationship by 
measuring the energy released as gamma rays following the capture of neutrons 
by atoms of silicon and sulfur, and comparing the gamma-ray energies with the 
difference between the initial and final masses. These experiments are consistent 
with E = mc? to a precision of about 4 x 1077. [See S. Rainville et al., Nature 
438, 1096 (2006).] 


Twin Paradox 


Although we cannot perform the experiment to test the twin paradox as we have 
described it, we can do an equivalent experiment. We take two clocks in our 
laboratory and synchronize them carefully. We then place one of the clocks in an 
airplane and fly it around the Earth. When we return the clock to the laboratory 
and compare the two clocks, we expect to find, if special relativity is correct, 
that the clock that has left the laboratory is the “younger” one—that is, it will 
have ticked away fewer seconds and appear to run behind its stationary twin. In 
this experiment, we must use very precise clocks based on the atomic vibrations 
of cesium in order to measure the time differences between the clock readings, 
which amount to only about 10~7 s. This experiment is complicated by several 
factors, all of which can be computed rather precisely: the rotating Earth is not 
an inertial frame (there is a centripetal acceleration), clocks on the surface of the 
Earth are already moving because of the rotation of the Earth, and the general 
theory of relativity predicts that a change in the gravitational field strength, which 
our moving clock will experience as it changes altitude in its airplane flight, will 
also change the rate at which the clock runs. In this experiment, as in the others 
we have discussed, the results are entirely in agreement with the predictions of 
special relativity. [See J. C. Hafele and R. E. Keating, Science 177, 166 (1972).] 

In a similar experiment, a cesium atomic clock carried on the space shuttle was 
compared with an identical clock on the Earth. The comparison was made through 
a radio link between the shuttle and the ground station. At an orbital height of 
about 328 km, the shuttle moves at a speed of about 7712 m/s, or 2.5 x 107>c. A 
clock moving at this speed runs slower than an identical clock at rest by the time 
dilation factor. For every second the clock is in orbit, it loses 330 ps relative to the 
clock on Earth; equivalently, it loses about 1.8 js per orbit. These time intervals 
can be measured with great precision, and the predicted asymmetric aging was 
verified to a precision of about 0.1%. [See E. Sappl, Naturwissenschaften 77, 
325 (1990).] 
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Explain in your own words what is meant by the term 
“relativity.” Are there different theories of relativity? 


Suppose the two observers and the rock described in the 
first paragraph of Section 2.1 were isolated in interstellar 
space. Discuss the two observers’ differing perceptions of 
the motion of the rock. Is there any experiment they can 
do to determine whether the rock is moving in any absolute 
sense? 


Describe the situation of Figure 2.4 as it would appear from 
the reference frame of O’. 


Does the Michelson-Morley experiment show that the ether 
does not exist or that it is merely unnecessary? 

Suppose we made a pair of shears in which the cutting blades 
were many orders of magnitude longer than the handle. Let 
us in fact make them so long that, when we move the handles 
at angular velocity w, a point on the tip of the blade has a 
tangential velocity v = wr that is greater than c. Does this 
contradict special relativity? Justify your answer. 

Light travels through water at a speed of about 2.25 x 108 
m/s. Is it possible for a particle to travel through water at a 
speed v greater than 2.25 x 10° m/s? 


64 


10. 


LA, 


12. 


13. 


Chapter 2 | The Special Theory of Relativity 


Is it possible to have particles that travel at the speed of 
light? What does Eq. 2.36 require of such particles? 
How does relativity combine space and time coordinates 
into spacetime? 
Einstein developed the relativity theory after trying unsuc- 
cessfully to imagine how a light beam would look to an 
observer traveling with the beam at speed c. Why is this so 
difficult to imagine? 
Explain in your own words the terms time dilation and 
length contraction. 
Does the Moon’s disk appear to be a different size to a 
space traveler approaching it at v = 0.99c, compared with 
the view of a person at rest at the same location? 
According to the time dilation effect, would the life 
expectancy of someone who lives at the equator be longer 
or shorter than someone who lives at the North Pole? By 
how much? 
Criticize the following argument. “Here is a way to travel 
faster than light. Suppose a star is 10 light-years away. A 
radio signal sent from Earth would need 20 years to make 
the round trip to the star. If I were to travel to the star in my 
rocket at v = 0.8c, to me the distance to the star is contracted 
1 — (0.8)? to 6 light-years, and at that speed it would 
take me 6 light-years/0.8c = 7.5 years to travel there. The 


2.1 Classical Relativity 


ie 


You are piloting a small airplane in which you want to reach 
a destination that is 750 km due north of your starting loca- 
tion. Once you are airborne, you find that (due to a strong 
but steady wind) to maintain a northerly course you must 
point the nose of the plane at an angle that is 22° west of 
true north. From previous flights on this route in the absence 
of wind, you know that it takes you 3.14 h to make the 
journey. With the wind blowing, you find that it takes 4.32 
h. A fellow pilot calls you to ask about the wind velocity 
(magnitude and direction). What is your report? 


A moving sidewalk 95 m in length carries passengers at 
a speed of 0.53 m/s. One passenger has a normal walking 
speed of 1.24 m/s. (a) If the passenger stands on the side- 
walk without walking, how long does it take her to travel 
the length of the sidewalk? (6) If she walks at her normal 
walking speed on the sidewalk, how long does it take to 
travel the full length? (c) When she reaches the end of the 
sidewalk, she suddenly realizes that she left a package at the 
opposite end. She walks rapidly back along the sidewalk at 
double her normal walking speed to retrieve the package. 
How long does it take her to reach the package? 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


round trip takes me only 15 years, and therefore I travel 
faster than light, which takes 20 years.” 

Is it possible to synchronize clocks that are in motion rela- 
tive to each other? Try to design a method to do so. Which 
observers will believe the clocks to be synchronized? 
Suppose event 4 causes event B. To one observer, event A 
comes before event B. Is it possible that in another frame of 
reference event B could come before event A? Discuss. 

Is mass a conserved quantity in classical physics? In special 
relativity? 

“In special relativity, mass and energy are equivalent.” 
Discuss this statement and give examples. 

Which is more massive, an object at low temperature or 
the same object at high temperature? A spring at its natural 
length or the same spring under compression? A container of 
gas at low pressure or at high pressure? A charged capacitor 
or an uncharged one? 

Could a collision be elastic in one frame of reference and 
inelastic in another? 

(a) What properties of nature would be different if there 
were a relativistic transformation law for electric charge? 
(b) What experiments could be done to prove that electric 
charge does not change with velocity? 


2.2 The Michelson-Morley Experiment 


3. 


A shift of one fringe in the Michelson-Morley experiment 
corresponds to a change in the round-trip travel time along 
one arm of the interferometer by one period of vibration 
of light (about 2 x 10~!> s) when the apparatus is rotated 
by 90°. Based on the results of Example 2.3, what veloc- 
ity through the ether would be deduced from a shift of 
one fringe? (Take the length of the interferometer arm to 
be 11 m.) 


2.4 Consequences of Einstein’s Postulates 


4. 


The distance from New York to Los Angeles is about 
5000km and should take about 50 h in a car driving at 
100 km/h. (a) How much shorter than 5000 km is the dis- 
tance according to the car travelers? (6) How much less than 
50 h do they age during the trip? 

How fast must an object move before its length appears to 
be contracted to one-half its proper length? 

An astronaut must journey to a distant planet, which is 
200 light-years from Earth. What speed will be necessary 
if the astronaut wishes to age only 10 years during the 
round trip? 


11. 


12. 
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The proper lifetime of a certain particle is 100.0 ns. (a) How 
long does it live in the laboratory if it moves at v = 0.960c? 
(6) How far does it travel in the laboratory during that time? 
(c) What is the distance traveled in the laboratory according 
to an observer moving with the particle? 

High-energy particles are observed in laboratories by pho- 
tographing the tracks they leave in certain detectors; the 
length of the track depends on the speed of the parti- 
cle and its lifetime. A particle moving at 0.995c leaves 
a track 1.25mm long. What is the proper lifetime of the 
particle? 

Carry out the missing steps in the derivation of Eq. 2.17. 
Two spaceships approach the Earth from opposite directions. 
According to an observer on the Earth, ship 4 is moving at 
a speed of 0.753c and ship B at a speed of 0.851c. What is 
the velocity of ship A as observed from ship B? Of ship B as 
observed from ship A? 

Rocket A leaves a space station with a speed of 0.826c. 
Later, rocket B leaves in the same direction with a speed of 
0.635c. What is the velocity of rocket A as observed from 
rocket B? 

One of the strongest emission lines observed from distant 
galaxies comes from hydrogen and has a wavelength of 
122nm (in the ultraviolet region). (a) How fast must a 
galaxy be moving away from us in order for that line to be 
observed in the visible region at 366 nm? (b) What would 
be the wavelength of the line if that galaxy were moving 
toward us at the same speed? 

A physics professor claims in court that the reason he 
went through the red light (A = 650 nm) was that, due to 
his motion, the red color was Doppler shifted to green 
(A = 550 nm). How fast was he going? 


2.5 The Lorentz Transformation 
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Derive the Lorentz velocity transformations for v’, and v}. 
Observer O fires a light beam in the y direction (v, = c). 
Use the Lorentz velocity transformation to find vi, and vi, 
and show that O’ also measures the value c for the speed of 
light. Assume that O’ moves relative to O with velocity u in 
the x direction. 

A light bulb at point x in the frame of reference of O 
blinks on and off at intervals At =, —t,. Observer O’, 
moving relative to O at speed u, measures the interval to be 
At = t, —¢,. Use the Lorentz transformation expressions 
to derive the time dilation expression relating At and Af’. 
A neutral K meson at rest decays into two z mesons, which 
travel in opposite directions along the x axis with speeds of 
0.828c. If instead the K meson were moving in the positive 
x direction with a velocity of 0.486c, what would be the 
velocities of the two 7 mesons? 

A rod in the reference frame of observer O makes an angle 
of 31° with the x axis. According to observer O’, who is in 
motion in the x direction with velocity u, the rod makes an 
angle of 46° with the x axis. Find the velocity wu. 
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According to observer O, two events occur separated by a 
time interval At = +0.465 jus and at locations separated by 
Ax = +53.4 m. (a) According to observer O’, who is in 
motion relative to O at a speed of 0.762c in the positive x 
direction, what is the time interval between the two events? 
(b) What is the spatial separation between the two events, 
according to O'? 

According to observer O, a blue flash occurs at x, = 10.4m 
when 4, = 0.124 ys, and a red flash occurs at x, = 23.6 m 
when ¢, = 0.138 js. According to observer O’, who is in 
motion relative to O at velocity u, the two flashes appear to 
be simultaneous. Find the velocity u. 


2.6 The Twin Paradox 
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Suppose the speed of light were 1000 mi/h. You are traveling 
on a flight from Los Angeles to Boston, a distance of 3000 
mi. The plane’s speed is a constant 600 mi/h. You leave Los 
Angeles at 10:00 A.M., as indicated by your wristwatch and 
by a clock in the airport. (a) According to your watch, what 
time is it when you land in Boston? (b) In the Boston airport 
is a clock that is synchronized to read exactly the same time 
as the clock in the Los Angeles airport. What time does that 
clock read when you land in Boston? (c) The following day 
when the Boston clock that records Los Angeles time reads 
10:00 A.M., you leave Boston to return to Los Angeles on 
the same airplane. When you land in Los Angeles, what are 
the times read on your watch and on the airport clock? 
Suppose rocket traveler Amelia has a clock made on Earth. 
Every year on her birthday she sends a light signal to brother 
Casper on Earth. (a) At what rate does Casper receive 
the signals during Amelia’s outward journey? (6) At what 
rate does he receive the signals during her return journey? 
(c) How many of Amelia’s birthday signals does Casper 
receive during the journey that he measures to last 20 years? 
Suppose Amelia traveled at a speed of 0.80c to a star that 
(according to Casper on Earth) is 8.0 light-years away. 
Casper ages 20 years during Amelia’s round trip. How much 
younger than Casper is Amelia when she returns to Earth? 
Make a drawing similar to Figure 2.20 showing the world- 
lines of Casper and Amelia from Casper’s frame of reference. 
Divide the world line for Amelia’s outward journey into 8 
equal segments (for the 8 birthdays that Amelia celebrates). 
For each birthday, draw a line that represents a light signal 
that Amelia sends to Casper on her birthday. Do the same 
for Amelia’s return journey. (a) According to Casper’s time, 
when does he receive the signal showing Amelia celebrating 
her 8th birthday after leaving Earth? (b) How long does 
it take for Casper to receive the signals showing Amelia 
celebrating birthdays 9 through 16? 


2.7 Relativistic Dynamics 


29: 


(a) Using the relativistically correct final velocities for 
the collision shown in Figure 2.21a (vi, = —0.585c, vy. = 
+0.294c), show that relativistic kinetic energy is conserved 
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according to observer O’. (b) Using the relativistically cor- 
rect final velocities for the collision shown in Figure 2.216 
(Vip = —0.051¢, vo¢ = +0.727c), show that relativistic 
kinetic energy is conserved according to observer O. 

Find the momentum, kinetic energy, and total energy of a 
proton moving at a speed of 0.756c. 

An electron is moving with a kinetic energy of 1.264 MeV. 
What is its speed? 

The work-energy theorem relates the change in kinetic 
energy of a particle to the work done on it by an external 
force: AK = W = f F dx. Writing Newton’s second law 
as F = dp/dt, show that W = f v dp and integrate by parts 
using the relativistic momentum to obtain Eq. 2.34. 

For what range of velocities of a particle of mass m can 
we use the classical expression for kinetic energy 5mv to 
within an accuracy of 1%? 

For what range of velocities of a particle of mass m can we 
use the extreme relativistic approximation E = pc to within 
an accuracy of 1%? 

Use Eqs. 2.32 and 2.36 to derive Eq. 2.39. 

Use the binomial expansion (1+x)"=1+nx+ 
[n(n — 1)/2!}x? +--+ to show that Eq. 2.34 for the rel- 
ativistic kinetic energy reduces to the classical expression 
smv" when v <c. This important result shows that our 
familiar expressions are correct at low speeds. By evaluat- 
ing the first term in the expansion beyond smv", find the 
speed necessary before the classical expression is off by 
0.01%. 

(a) According to observer O, a certain particle has a 
momentum of 817 MeV/c and a total relativistic energy 
of 1125 MeV. What is the rest energy of this particle? 
(b) An observer O’ in a different frame of reference mea- 
sures the momentum of this particle to be 953 MeV/c. 
What does O’ measure for the total relativistic energy of the 
particle? 

An electron is moving at a speed of 0.81c. By how much 
must its kinetic energy increase to raise its speed to 0.91c? 
What is the change in mass when | g of copper is heated 
from 0 to 100°C? The specific heat capacity of copper is 
0.40 J/g-K. 

Find the kinetic energy of an electron moving at a speed of 
(a) v= 1.00 x 10-4ce; (6) v=1.00 x 10-7; (c) v= 
0.300c; (d) v = 0.999c. 

An electron and a proton are each accelerated starting from 
rest through a potential difference of 10.0 million volts. Find 
the momentum (in MeV/c) and the kinetic energy (in MeV) 
of each, and compare with the results of using the classical 
formulas. 

In a nuclear reactor, each atom of uranium (of atomic mass 
235 u) releases about 200 MeV when it fissions. What is the 
change in mass when 1.00 kg of uranium-235 is fissioned? 


2.8 Conservation Laws in Relativistic Decays and Collisions 
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A x meson of rest energy 139.6 MeV moving at a speed of 
0.906c collides with and sticks to a proton of rest energy 
938.3 MeV that is at rest. (a) Find the total relativistic 
energy of the resulting composite particle. (b) Find the total 
linear momentum of the composite particle. (c) Using the 
results of (a) and (b), find the rest energy of the composite 
particle. 

An electron and a positron (an antielectron) make a head-on 
collision, each moving at v = 0.99999c. In the collision 
the electrons disappear and are replaced by two muons 
(mc? = 105.7 MeV), which move off in opposite directions. 
What is the kinetic energy of each of the muons? 

It is desired to create a particle of mass 9700 MeV/c? ina 
head-on collision between a proton and an antiproton (each 
having a mass of 938.3 MeV/c’) traveling at the same speed. 
What speed is necessary for this to occur? 

A particle of rest energy mc” is moving with speed v in the 
positive x direction. The particle decays into two particles, 
each of rest energy 140 MeV. One particle, with kinetic 
energy 282 MeV, moves in the positive x direction, and the 
other particle, with kinetic energy 25 MeV, moves in the 
negative x direction. Find the rest energy of the original 
particle and its speed. 


2.9 Experimental Tests of Special Relativity 


43. 


44. 


In the muon decay experiment discussed in Section 2.9 as a 
verification of time dilation, the muons move in the lab with 
a momentum of 3094 MeV/c. Find the dilated lifetime in 
the laboratory frame. (The proper lifetime is 2.198 jus.) 
Derive the relativistic expression p?/2K = m+ K/2c’, 
which is plotted in Figure 2.284. 


General Problems 
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Suppose we want to send an astronaut on a round trip to 
visit a star that is 200 light-years distant and at rest with 
respect to Earth. The life support systems on the spacecraft 
enable the astronaut to survive at most 20 years. (a) At what 
speed must the astronaut travel to make the round trip in 
20 years of spacecraft time? (b) How much time passes on 
Earth during the round trip? 

A “cause” occurs at point 1 (x,,¢,) and its “effect” occurs 
at point 2 (x,,t,). Use the Lorentz transformation to find 
t, — t,, and show that ¢, — ¢, > 0; that is, O’ can never see 
the “effect” coming before its “cause.” 

Observer O sees a red flash of light at the origin at t = 0 and 
a blue flash of light at x = 3.26 km at a time ¢ = 7.63 ys. 
What are the distance and the time interval between the 
flashes according to observer O’, who moves relative to O 
in the direction of increasing x with a speed of 0.625c? 
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Assume that the origins of the two coordinate systems line 
upatr=/ =0. 

Several spacecraft (A,B,C, and D) leave a space station 
at the same time. Relative to an observer on the station, 
A travels at 0.60c in the x direction, B at 0.50c in the y 
direction, C at 0.50c in the negative x direction, and D at 
0.50c at 45° between the y and negative x directions. Find 
the velocity components, directions, and speeds of B, C, and 
D as observed from A. 

Observer O sees a light turn on at x = 524 m when 
t = 1.52 ws. Observer O’ is in motion at a speed of 0.563c 
in the positive x direction. The two frames of reference are 
synchronized so that their origins match up (x = x’ = 0) at 
t= ¢ = 0. (a) At what time does the light turn on according 
to O'? (b) At what location does the light turn on in the 
reference frame of O'? 

Suppose an observer O measures a particle of mass m 
moving in the x direction to have speed v, energy E, 
and momentum p. Observer O’, moving at speed w in 
the x direction, measures v’, E’, and p’ for the same object. 
(a) Use the Lorentz velocity transformation to find E’ and 
p’ in terms of m,u, and v. (b) Reduce E? — (p'c)* to its 
simplest form and interpret the result. 

Repeat Problem 50 for the mass moving in the y direction 
according to O. The velocity u of O’ is still along the x 
direction. 

Consider again the situation described in Section 2.6. 
Amelia’s friend Bernice leaves Earth at the same time 
as Amelia and travels in the same direction at the same 
speed, but Bernice continues in the original direction when 
Amelia reaches the planet and turns her ship around. 
(a) From Bernice’s frame of reference, Casper is moving 
at a velocity of —0.60c. Draw Casper’s worldline in Ber- 
nice’s frame of reference. (b) Casper celebrates 20 birthdays 
during Amelia’s journey. In Bernice’s frame of reference, 
how long does it take for Casper to celebrate 20 birthdays? 
(c) In Bernice’s frame of reference, draw a worldline repre- 
senting Amelia’s outbound journey to the planet. (d) Calcu- 
late Amelia’s velocity during her return journey as observed 
from Bernice’s frame of reference, and draw a worldline 
showing Amelia’s return journey. Amelia’s and Casper’s 
worldlines should intersect when Amelia return to Earth. 


23% 


54. 


2. 


56. 


a7 


Problems 67 


(e) Divide Casper’s worldline into 20 segments, represent- 
ing his birthdays. He sends a light signal to Amelia on each 
birthday. Amelia receives a light signal from Casper just as 
she arrives at the planet. On which birthday did Casper send 
this signal? (f) Amelia sends Casper a light signal on her 
8th birthday. Draw a line on your diagram representing this 
light signal. When does Casper receive this signal? 
Electrons are accelerated to high speeds by a two-stage 
machine. The first stage accelerates the electrons from rest 
to v= 0.99c. The second stage accelerates the electrons 
from 0.99c to 0.999c. (a) How much energy does the first 
stage add to the electrons? (6) How much energy does the 
second stage add in increasing the velocity by only 0.9%? 
A beam of 1.35 x 10!! electrons/s moving at a speed of 
0.732c strikes a block of copper that is used as a beam 
stop. The copper block is a cube measuring 2.54 cm on edge. 
What is the temperature increase of the block after one hour? 
An electron moving at a speed of v; = 0.960c in the positive 
x direction collides with another electron at rest. After the 
collision, one electron is observed to move with a speed of 
Vip = 0.956c at an angle of 0; = 9.7° with the x axis. (a) Use 
conservation of momentum to find the velocity (magnitude 
and direction) of the second electron. (6) Based only on 
the original data given in the problem, use conservation of 
energy to find the speed of the second electron. 

A pion has a rest energy of 135 MeV. It decays into two 
gamma ray photons, bursts of electromagnetic radiation that 
travel at the speed of light. A pion moving through the 
laboratory at v = 0.98c decays into two gamma ray photons 
of equal energies, making equal angles 6 with the original 
direction of motion. Find the angle @ and the energies of the 
two gamma ray photons. 

Consider again the decay described in Example 2.16 and 
determine the energies of the two pi mesons emitted in the 
decay of the K meson by first making a Lorentz transfor- 
mation to a reference frame in which the initial K meson is 
at rest. When a K meson at rest decays into two pi mesons, 
they move in opposite directions with equal and opposite 
velocities, so they share the decay energy equally. Find 
the energies and velocities of the two pi mesons in the K 
meson’s rest frame. Then transform back to the lab frame to 
find their kinetic energies. 
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We now turn to a discussion of wave mechanics, the second theory on which 
modern physics is based. One consequence of wave mechanics is the breakdown 
of the classical distinction between particles and waves. In this chapter we consider 
the three early experiments that provided evidence that light, which we usually 
regard as a wave phenomenon, has properties that we normally associate with 
particles. Instead of spreading its energy smoothly over a wave front, the energy 
is delivered in concentrated bundles like particles; a discrete bundle (quantum) of 
electromagnetic energy is known as a photon. 

Before we begin to discuss the experimental evidence that supports the 
existence of the photon and the particlelike properties of light, we first review 
some of the properties of electromagnetic waves. 


3.7 REWIEW OF ELECTROMAGNETIC WAVES 


An electromagnetic field is characterized by its electric field E and magnetic field 
B. For example, the electric field at a distance r from a point charge q at the 
origin is 





I 4. 
Amey r pe 


E= (3.1) 
where f is a unit vector in the radial direction. The magnetic field at a distance r 
from a long, straight, current-carrying wire along the z axis is 


meals (3:2) 


where @ is the unit vector in the azimuthal direction (in the xy plane) in cylindrical 
coordinates. 

If the charges are accelerated, or if the current varies with time, an electro- 
magnetic wave is produced, in which E and B vary not only with r but also 
with ¢. The mathematical expression that describes such a wave may have many 
different forms, depending on the properties of the source of the wave and of the 
medium through which the wave travels. One special form is the plane wave, in 
which the wave fronts are planes. (A point source, on the other hand, produces 
spherical waves, in which the wave fronts are spheres.) A plane electromagnetic 
wave traveling in the positive z direction is described by the expressions 


E = Ey sin(kz — of), B = B, sin(kz — of) (3.3) 


where the wave number k is found from the wavelength 4 (A = 27/1) and the 
angular frequency w is found from the frequency f (w = 27f). Because 4 and f 
are related by c = Af, k and w are also related by c = w/k. 

The polarization of the wave is represented by the vector E,; the plane of 
polarization is determined by the direction of E, and the direction of propagation, 
the z axis in this case, Once we specify the direction of travel and the polarization 
Eo, the direction of Bo is fixed by the requirements that B must be > perpendicular 
to both E and the direction of travel, and that the vector product E x B point in 
the direction of travel. For example if Ey i is in the x direction (E, = = Evi where i 
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is a unit vector in the x direction), then By must be in the y direction (By = Bj). 
Moreover, the magnitude of By is determined by 
Eo 
By = — (3.4) 
c 
where c is the speed of light. 
An electromagnetic wave transmits energy from one place to another; the 
energy flux is specified by the Poynting vector S: 


2 Ls + 
S=—ExB (3.5) 
Ho 


For the plane wave, this reduces to 


> 1 i 
S = —E,By sin’ (kz — wt)k (3.6) 
Ho 


where k is a unit vector in the z direction. The Poynting vector has dimensions 
of power (energy per unit time) per unit area—for example, J/ s/m* or W/m”. 
Figure 3.1 shows the orientation of the vectors E, B, and S for this special case. 

Let us imagine the following experiment. We place a detector of electromag- 
netic radiation (a radio receiver or a human eye) at some point on the z axis, 
and we determine the electromagnetic power that this plane wave delivers to the 
receiver. The receiver is oriented with its sensitive area A perpendicular to the z 
axis, so that the maximum signal is received; we can therefore drop the vector 
representation of S and work only with its magnitude S. The power P entering the 
receiver is then 


1 
P = SA = —EB,A sin’ (kz — of) (3.7) 
Mo 
which we can rewrite using Eq. 3.4 as 
1 
P = —E?A sin’ (kz — ot) (3.8) 
Moe 


There are two important features of this expression that you should recognize: 


1. The intensity (the average power per unit area) is proportional to EF. This 
is a general property of waves: the intensity is proportional to the square of the 
amplitude. We will see later that this same property also characterizes the waves 
that describe the behavior of material particles. 

2. The intensity fluctuates with time, with the frequency 2f = 2(w/2z7). We 
don’t usually observe this rapid fluctuation—visible light, for example, has a 
frequency of about 10!> oscillations per second, and because our eye doesn’t 
respond that quickly, we observe the time average of many (perhaps 10!) cycles. 
If T is the observation time (perhaps 10~° s in the case of the eye) then the average 
power is 








1 T 
Pe al Pdt (3.9) 
0 
and using Eq. 3.8 we obtain the intensity J: 
P 1 
= - = om ma (3.10) 
0 


because the average value of sin’6 is !/,. 











FIGURE 3.1 An electromagnetic wave 
traveling in the z direction. The electric 
field E lies in_the xz plane and the 
magnetic field B lies in the yz plane. 
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FIGURE 3.2 


screen. 


(a) Young’s double- 
slit experiment. A plane wave front 
passes through both slits; the wave 
is diffracted at the slits, and inter- 
ference occurs where the diffracted 
waves overlap on the screen. (b) The 
interference fringes observed on the 


Interference and Diffraction 


The property that makes waves a unique physical phenomenon is the principle 
of superposition, which, for example, allows two waves to meet at a point, to 
cause a combined disturbance at the point that might be greater or less than the 
disturbance produced by either wave alone, and finally to emerge from the point 
of “collision” with all of the properties of each wave totally unchanged by the 
collision. To appreciate this important distinction between material objects and 
waves, imagine trying that trick with two automobiles! 

This special property of waves leads to the phenomena of interference and 
diffraction. The simplest and best-known example of interference is Young’s 
double-slit experiment, in which a monochromatic plane wave is incident on 
a barrier in which two narrow slits have been cut. (This experiment was first 
done with light waves, but in fact any wave will do as well, not only other 
electromagnetic waves, such as microwaves, but also mechanical waves, such as 
water waves or sound waves. We assume that the experiment is being done with 
light waves.) 

Figure 3.2 illustrates this experimental arrangement. The plane wave is 
diffracted by each of the slits, so that the light passing through each slit covers a 
much larger area on the screen than the geometric shadow of the slit. This causes 
the light from the two slits to overlap on the screen, producing the interference. 
If we move away from the center of the screen just the right distance, we reach 
a point at which a wave crest passing through one slit arrives at exactly the 
same time as the previous wave crest that passed through the other slit. When this 
occurs, the intensity is a maximum, and a bright region appears on the screen. This 
is constructive interference, and it occurs continually at the point on the screen 
that is exactly one wavelength further from one slit than from the other. That is, 
if X, and X, are the distances from the point on the screen to the two slits, then a 
condition for maximum constructive interference is |X, —_X,| = A. Constructive 
interference occurs when any wave crest from one slit arrives simultaneously 
with another from the other slit, whether it is the next, or the fourth, or the 
forty-seventh. The general condition for complete constructive interference is that 
the difference between X, and_X, be an integral number of wavelengths: 

[X, —X,| = nar n=0,1,2,... (3.11) 
It is also possible for the crest of the wave from one slit to arrive at a point on 
the screen simultaneously with the trough (valley) of the wave from the other slit. 
When this happens, the two waves cancel, giving a dark region on the screen. This 
is known as destructive interference. (The existence of destructive interference at 
intensity minima immediately shows that we must add the electric field vectors E 
of the waves from the two slits, and not their powers P, because P can never be 
negative.) Destructive interference occurs whenever the distances X, and X, are 
such that the phase of one wave differs from the other by one-half cycle, or by 
one and one-half cycles, two and one-half cycles, and so forth: 
|X, —X4| = 54,54,34,...=(n+5)A  n =0,1,2,... (3.12) 
We can find the locations on the screen where the interference maxima occur in 
the following way. Let d be the separation of the slits, and let D be the distance 
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from the slits to the screen. Ify,, is the distance from the center of the screen to the 
nth maximum, then from the geometry of Figure 3.3 we find (assuming X, > X>) 


d ; d . 

X? =D? + (5 +y) and X3 =D? + (5 -y,) (3.13) 
Subtracting these equations and solving for y,, we obtain 
XP -Xp (XH +-%)X — X9) 

2d 2d 


In experiments with light, D is of order | m, and y, and d are typically at most 
1mm; thus X, = D and_X, = D, so X, +X, = 2D, and to a good approximation 





Yn = (3.14) 


D 
VY, = (X%} =Aas (3.15) 
Using Eq. 3.11 for the values of (X, — X,) at the maxima, we find 
rs (3.16) 
Yn an d = 


Crystal Diffraction of X Rays 


Another device for observing the interference of light waves is the diffraction 
grating, in which the wave fronts pass through a barrier that has many slits 
(often thousands or tens of thousands) and then recombine. The operation of this 
device is illustrated in Figure 3.4; interference maxima corresponding to different 
wavelengths appear at different angles 0, according to 


dsin@ = nk (3.17) 


where d is the slit spacing and 7 is the order number of the maximum 
(n = 1,2,3,...). 

The advantage of the diffraction grating is its superior resolution—it enables us 
to get very good separation of wavelengths that are close to one another, and thus it 
is a very useful device for measuring wavelengths. Notice, however, that in order 
to get reasonable values of the angle 6—for example, sin @ in the range of 0.3 
to 0.5—we must have d of the order of a few times the wavelength. For visible 
light this is not particularly difficult, but for radiations of very short wavelength, 
mechanical construction of a grating is not possible. For example, for X rays with 
a wavelength of the order of 0.1 nm, we would need to construct a grating in 
which the slits were less than 1 nm apart, which is roughly the same as the spacing 
between the atoms of most materials. 

The solution to this problem has been known since the pioneering experiments 
of Laue and Bragg:* use the atoms themselves as a diffraction grating! A beam 
of X rays sees the regular spacings of the atoms in a crystal as a sort of 
three-dimensional diffraction grating. 


*Max von Laue (1879-1960, Germany) developed the method of X-ray diffraction for the study of 
crystal structures, for which he received the 1914 Nobel Prize. Lawrence Bragg (1890—1971, England) 
developed the Bragg law for X-ray diffraction while he was a student at Cambridge University. He 
shared the 1915 Nobel Prize with his father, William Bragg, for their research on the use of X rays to 
determine crystal structures. 








FIGURE 3.3 The geometry of the 
double-slit experiment. 
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FIGURE 3.4 The use of a diffrac- 
tion grating to analyze light into its 
constituent wavelengths. 


74 Chapter 3 | The Particlelike Properties of Electromagnetic Radiation 


~ Reflection_ 
planes 








——-e— — ——e|— }e- — #-——-e ——-—---- 


dsin @ 


FIGURE 3.5 A beam of X rays 
reflected from a set of crystal planes 
of spacing d. The beam reflected from 
the second plane travels a distance 2d 
sin 6 greater than the beam reflected 
from the first plane. 


| Example 3.1 


Consider the set of atoms shown in Figure 3.5, which represents a small portion 
of a two-dimensional slice of the crystal. The X rays are reflected from individual 
atoms in all directions, but in only one direction will the scattered “wavelets” 
constructively interfere to produce a reflected beam, and in this case we can 
regard the reflection as occurring from a plane drawn through the row of atoms. 
(This situation is identical with the reflection of light from a mirror—only in 
one direction will there be a beam of reflected light, and in that direction we can 
regard the reflection as occurring on a plane with the angle of incidence equal to 
the angle of reflection.) 

Suppose the rows of atoms are a distance d apart in the crystal. Then a portion 
of the beam is reflected from the front plane, and a portion is reflected from the 
second plane, and so forth. The wave fronts of the beam reflected from the second 
plane lag behind those reflected from the front plane, because the wave reflected 
from the second plane must travel an additional distance of 2d sin 0, where 0 
is the angle of incidence as measured from the face of the crystal. (Note that 
this is different from the usual procedure in optics, in which angles are defined 
with respect to the normal to the surface.) If this path difference is a whole 
number of wavelengths, the reflected beams interfere constructively and give an 
intensity maximum; thus the basic expression for the interference maxima in 
X-ray diffraction from a crystal is 


Qdsind=nd  n=1,2,3,... (3.18) 


This result is known as Bragg’s law for X-ray diffraction. Notice the factor of 2 
that appears in Eq. 3.18 but does not appear in the otherwise similar expression 
of Eq. 3.17 for the ordinary diffraction grating. 





A single crystal of table salt (NaCl) is irradiated with Solution 
a beam of X rays of wavelength 0.250 nm, and the first Solving Bragg’s law for the spacing d, we have 


Bragg reflection is observed at an angle of 26.3°. What is 


the atomic spacing of NaCl? 


(eS 0a 
2sin6 2sin26.3° 








Incident Reflected 





FIGURE 3.6 An incident beam of X 
rays can be reflected from many dif- 
ferent crystal planes. 


Our drawing of Figure 3.5 was very arbitrary—we had no basis for choosing 
which set of atoms to draw the reflecting planes through. Figure 3.6 shows a larger 
section of the crystal. As you can see, there are many possible reflecting planes, 
each with a different value of 0 and d. (Of course, d; and 6; are related and cannot 
be varied independently.) If we used a beam of X rays of a single wavelength, 
it might be difficult to find the proper angle and set of planes to observe the 
interference. However, if we use a beam of X rays of a continuous range of 
wavelengths, for each d, and 6, interference will occur for a certain wavelength 
A,, and so there will be a pattern of interference maxima appearing at different 
angles of reflection as shown in Figure 3.6. The pattern of interference maxima 
depends on the spacing and the type of arrangement of the atoms in the crystal. 

Figure 3.7 shows sample patterns (called Laue patterns) that are obtained 
from X-ray scattering from two different crystals. The bright dots correspond to 
interference maxima for wavelengths from the range of incident wavelengths that 
happen to satisfy Eq. 3.18. The three-dimensional pattern is more complicated 
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FIGURE 3.7 (a) Apparatus for observing X-ray scattering by a crystal. An interference maximum (dot) appears on the 
film whenever a set of crystal planes happens to satisfy the Bragg condition for a particular wavelength. (5) Laue pattern 
of TiO, crystal. (c) Laue pattern of a polyethylene crystal. The differences between the two Laue patterns are due to the 
differences in the geometric structure of the two crystals. 


than our two-dimensional drawings, but the individual dots have the same 
interpretation. Figure 3.8 shows the pattern obtained from a sample that consists 
of many tiny crystals, rather than one single crystal. (It looks like Figure 3.7b 
or 3.7c rotated rapidly about its center.) From such pictures it is also possible to 
deduce crystal structures and lattice spacing. 

All of the examples we have discussed in this section depend on the wave 
properties of electromagnetic radiation. However, as we now begin to discuss, 
there are other experiments that cannot be explained if we regard electromagnetic 
radiation as waves. Incident Scattered 


X rays X rays 
3.2 THE PHOTOELECTRIC EFFECT 


(a) 
We’ll now turn to our discussion of the first of three experiments that cannot be 
explained by the wave theory of light. When a metal surface is illuminated with 
light, electrons can be emitted from the surface. This phenomenon, known as the 
photoelectric effect, was discovered by Heinrich Hertz in 1887 in the process 
of his research into electromagnetic radiation. The emitted electrons are called 
photoelectrons. 
A sample experimental arrangement for observing the photoelectric effect 
is illustrated in Figure 3.9. Light falling on a metal surface (the emitter) can @) 
release electrons, which travel to the collector. The experiment must be done FIGURE S© (a) Anpatitis for ‘ob 
in an evacuated tube, so that the electrons do not lose energy in collisions with scp eae tence “HG oa 
molecules of the air. Among the properties that can be measured are the rate of aca a Acai - : eer Seca 
electron emission and the maximum kinetic energy of the photoelectrons.* P Rape gions ae 
8 : : Because the individual crystals have 
The rate of electron emission can be measured as an electric current i by an ‘ : : 
i ae : ee many different orientations, each scat- 
ammeter in the external circuit. The maximum kinetic energy of the electrons . 
tered ray of Figure 3.7 becomes a 


cone which forms a circle on the film. 
: . . a . : ; (6) Diffraction pattern (known as 
*The electrons can be emitted with many different kinetic energies, depending on how tightly bound 


they are to the metal. Here we are concerned only with the maximum kinetic energy, which depends Debye-Scherrer pattern) of polycrys- 
on the energy needed to remove the least tightly bound electron from the surface of the metal. talline gold. 
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Emitter 


Collector 














FIGURE 3.9 Apparatus for observing 
the photoelectric effect. The flow of 
electrons from the emitter to the col- 
lector is measured by the ammeter A 
as a current 7 in the external circuit. 
A variable voltage source V,,, estab- 
lishes a potential difference between 
the emitter and collector, which is 





Vext 


measured by the voltmeter V. 


TABLE 3.1 Some Photoelectric Work 


























Functions 
Material ¢ (eV) 
Na 2.28 
Al 4.08 
Co 3.90 
Cu 4.70 
Zn 4.31 
Ag 4.73 
Pt 6.35 
Pb 4.14 














can be measured by applying a negative potential to the collector that is just 
enough to repel the most energetic electrons, which then do not have enough 
energy to “climb” the potential energy hill. That is, if the potential difference 
between the emitter and the collector is AV (a negative quantity), then electrons 
traveling from the emitter to the collector would gain a potential energy of 
AU =q AV = —e AV (a positive quantity) and would lose the same amount of 
kinetic energy. Electrons leaving the emitter with a kinetic energy smaller than 
this AU cannot reach the collector and are pushed back toward the emitter. 

As the magnitude of the potential difference is increased, at some point even the 
most energetic electrons do not have enough kinetic energy to reach the collector. 
This potential, called the stopping potential V,, is determined by increasing the 
magnitude of the voltage until the ammeter current drops to zero. At this point 
the maximum kinetic energy K,,,,, of the electrons as they leave the emitter is just 


max 
equal to the kinetic energy eV, lost by the electrons in “climbing” the hill: 


Kuax = Vs (3.19) 


where e is the magnitude of the electric charge of the electron. Typical values of 
V, are a few volts.* 

In the classical picture, the surface of the metal is illuminated by an electro- 
magnetic wave of intensity 7. The surface absorbs energy from the wave until 
the energy exceeds the binding energy of the electron to the metal, at which 
point the electron is released. The minimum quantity of energy needed to remove 
an electron is called the work function @ of the material. Table 3.1 lists some 
values of the work function of different materials. You can see that the values are 
typically a few electron-volts. 


The Classical Theory of the Photoelectric 
Effect 


What does the classical wave theory predict about the properties of the emitted 
photoelectrons? 


1. The maximum kinetic energy of the electrons should be proportional to the 
intensity of the radiation. As the brightness of the light source is increased, 
more energy is delivered to the surface (the electric field is greater) and 
the electrons should be released with greater kinetic energies. Equivalently, 
increasing the intensity of the light source increases the electric field E of the 
wave, which also increases the force F = —eE on the electron and its kinetic 
energy when it eventually leaves the surface. 

2. The photoelectric effect should occur for light of any frequency or wavelength. 
According to the wave theory, as long as the light is intense enough to release 
electrons, the photoelectric effect should occur no matter what the frequency 
or wavelength. 

3. The first electrons should be emitted in a time interval of the order of seconds 
after the radiation begins to strike the surface. In the wave theory, the energy 
of the wave is uniformly distributed over the wave front. If the electron 
absorbs energy directly from the wave, the amount of energy delivered to any 


*The potential difference AV read by the voltmeter is not equal to the stopping potential when the 
emitter and collector are made of different materials. In that case a correction must be applied to 
account for the contact potential difference between the emitter and collector. 


electron is determined by how much radiant energy is incident on the surface 
area in which the electron is confined. Assuming this area is about the size of 
an atom, a rough calculation leads to an estimate that the time lag between 
turning on the light and observing the first photoelectrons should be of the 
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order of seconds (see Example 3.2). 


J Example 3.2 





A laser beam with an intensity of 120 W/m? (roughly that 
of a small helium-neon laser) is incident on a surface of 
sodium. It takes a minimum energy of 2.3 eV to release 
an electron from sodium (the work function @ of sodium). 
Assuming the electron to be confined to an area of radius 
equal to that of a sodium atom (0.10 nm), how long will it 
take for the surface to absorb enough energy to release an 
electron? 


Solution 

The average power P,, delivered by the wave of intensity 
I to an area A is JA. An atom on the surface dis- 
plays a “target area” of A = mr? = 1(0.10 x 107° m)? = 
3.1 x 10-79 m?. If the entire electromagnetic power is 
delivered to the electron, energy is absorbed at the rate 


AE/At = P,,. The time interval Af necessary to absorb an 
energy AE = ¢ can be expressed as 


AE 
eo 
P,, IA 


(2.3 eV)(1.6 x 10719 J/eV) 


(120 W/m?) (3.1 x 10-29 m?) 





In reality, electrons in metals are not always bound to indi- 
vidual atoms but instead can be free to roam throughout the 
metal. However, no matter what reasonable estimate we 
make for the area over which the energy is absorbed, the 
characteristic time for photoelectron emission is estimated 
to have a magnitude of the order of seconds, in a range 
easily accessible to measurement. 





The experimental characteristics of the photoelectric effect were well known 


by the year 1902. How do the predictions of the classical theory compare with the 
experimental results? 


1. 


For a fixed value of the wavelength or frequency of the light source, the 
maximum kinetic energy of the emitted photoelectrons (determined from the 
stopping potential) is totally independent of the intensity of the light source. 
Figure 3.10 shows a representation of the experimental results. Doubling the 
intensity of the source leaves the stopping potential unchanged, indicating no 
change in the maximum kinetic energy of the electrons. This experimental 
result disagrees with the wave theory, which predicts that the maximum 
kinetic energy should depend on the intensity of the light. 

The photoelectric effect does not occur at all if the frequency of the light 
source is below a certain value. This value, which is characteristic of the 
kind of metal surface used in the experiment, is called the cutoff frequency f,. 
Above f,, any light source, no matter how weak, will cause the emission of 
photoelectrons; below f,, no light source, no matter how strong, will cause the 
emission of photoelectrons. This experimental result also disagrees with the 
predictions of the wave theory. 

The first photoelectrons are emitted virtually instantaneously (within 10~°s) 
after the light source is turned on. The wave theory predicts a measurable 
time delay, so this result also disagrees with the wave theory. 


These three experimental results all suggest the complete failure of the wave 
theory to account for the photoelectric effect. 


Current i 


In = 2h, 








0 
Potential difference AV 


FIGURE 3.10 The photoelectric cur- 
rent 7 as a function of the potential 
difference AV for two different val- 
ues of the intensity of the light. When 
the intensity / is doubled, the current 
is doubled (twice as many photoelec- 
trons are emitted), but the stopping 
potential V, remains the same. 
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The Quantum Theory of the Photoelectric 
Effect 


A successful theory of the photoelectric effect was developed in 1905 by Albert 
Einstein. Five years earlier, in 1900, the German physicist Max Planck had 
developed a theory to explain the wavelength distribution of light emitted by 
hot, glowing objects (called thermal radiation, which is discussed in the next 
section of this chapter). Based partly on Planck’s ideas, Einstein proposed that the 
energy of electromagnetic radiation is not continuously distributed over the wave 
front, but instead is concentrated in localized bundles or quanta (also known as 
photons). The energy of a photon associated with an electromagnetic wave of 
frequency f is 


E=hf (3.20) 
where h is a proportionality constant known as Planck’s constant. The photon 


energy can also be related to the wavelength of the electromagnetic wave by 
substituting f = c/A, which gives 


_ he 
~ 4 


E (3.21) 
We often speak about photons as if they were particles, and as concentrated 
bundles of energy they have particlelike properties. Like the electromagnetic 
waves, photons travel at the speed of light, and so they must obey the relativistic 
relationship p = E/c. Combining this with Eq. 3.21, we obtain 


= (3.22) 
Photons carry linear momentum as well as energy, and thus they share this 
characteristic property of particles. 

Because a photon travels at the speed of light, it must have zero mass. Otherwise 
its energy and momentum would be infinite. Similarly, a photon’s rest energy 
Eo = mc’ must also be zero. 

In Einstein’s interpretation, a photoelectron is released as a result of an 
encounter with a single photon. The entire energy of the photon is delivered 
instantaneously to a single photoelectron. If the photon energy Af is greater than 
the work function ¢ of the material, the photoelectron will be released. If the 
photon energy is smaller than the work function, the photoelectric effect will not 
occur. This explanation thus accounts for two of the failures of the wave theory: 
the existence of the cutoff frequency and the lack of any measurable time delay. 

If the photon energy hf exceeds the work function, the excess energy appears 
as the kinetic energy of the electron: 


Kmax = hf —¢ (3.23) 


The intensity of the light source does not appear in this expression! For a fixed 
frequency, doubling the intensity of the light means that twice as many photons 
strike the surface and twice as many photoelectrons are released, but they all have 
precisely the same maximum kinetic energy. 


You can think of Eq. 3.23 as giving a relationship between energy quantities in 
analogy to making a purchase at a store. The quantity 4f represents the payment 
you hand to the cashier, the quantity @ represents the cost of the object, and K,,,x 
represents the change you receive. In the photoelectric effect, hf is the amount 
of energy that is available to “purchase” an electron from the surface, the work 
function ¢ is the “cost” of removing the least tightly bound electron from the 
surface, and the difference between the available energy and the removal cost is 
the leftover energy that appears as the kinetic energy of the emitted electron. (The 
more tightly bound electrons have a greater “cost” and so emerge with smaller 
kinetic energies.) 

A photon that supplies an energy equal to ¢, exactly the minimum amount 
needed to remove an electron, corresponds to light of frequency equal to the cutoff 
frequency f,. At this frequency, there is no excess energy for kinetic energy, so 
Eq. 3.23 becomes hf, = ¢, or 


p 
= a (3.24) 
The corresponding cutoff wavelength A, = c/f, is 
h 
ees (3.25) 
¢ 


The cutoff wavelength represents the /argest wavelength for which the photoelec- 
tric effect can be observed for a surface with the work function ¢. 

The photon theory appears to explain all of the observed features of the 
photoelectric effect. The most detailed test of the theory was done by Robert 
Millikan in 1915. Millikan measured the maximum kinetic energy (stopping 
potential) for different frequencies of the light and obtained a plot of Eq. 3.23. A 
sample of his results is shown in Figure 3.11. From the slope of the line, Millikan 
obtained a value for Planck’s constant of 


h= 6.57 x 107**J-s 


In part for his detailed experiments on the photoelectric effect, Millikan was 
awarded the 1923 Nobel Prize in physics. Einstein was awarded the 1921 Nobel 
Prize for his photon theory as applied to the photoelectric effect. 

As we discuss in the next section, the wavelength distribution of thermal 
radiation also yields a value for Planck’s constant, which is in good agreement 
with Millikan’s value derived from the photoelectric effect. Planck’s constant is 
one of the fundamental constants of nature; just as c is the characteristic constant 
of relativity, 4 is the characteristic constant of quantum mechanics. The value of 
Planck’s constant has been measured to great precision in a variety of experiments. 
The presently accepted value is 


h = 6.6260696 x 107*4J-s 


This is an experimentally determined value, with a relative uncertainty of about 
5 x 10-8 (+3 units in the last digit). 
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Robert A. Millikan (1868-1953, 
United States). Perhaps the best exper- 
imentalist of his era, his work included 
the precise determination of Planck’s 
constant using the photoelectric effect 
(for which he received the 1923 Nobel 
Prize) and the measurement of the 
charge of the electron (using his 
famous “oil-drop” apparatus). 
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FIGURE 3.11 Millikan’s results for 
the photoelectric effect in sodium. The 
slope of the line is h/e; the experimen- 
tal determination of the slope gives a 
way of determining Planck’s constant. 
The intercept should give the cut- 
off frequency; however, in Millikan’s 
time the contact potentials of the elec- 
trodes were not known precisely and 
so the vertical scale is displaced by 
a few tenths of a volt. The slope not 
affected by this correction. 
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| Example 3.3 





(a) What are the energy and momentum of a photon of red 
light of wavelength 650 nm? (5) What is the wavelength of 
a photon of energy 2.40 eV? 


Solution 
(a) Using Eq. 3.21 we obtain 


he _ (6.63 x 10~*4 J-s)(3.00 x 108 m/s) 
je 650 x 10-9 m 
= 3.06 x 10°19 J 


E= 





Converting to electron-volts, we have 


3.06 x 10°19 J 
~ 1.60 x 10-9 eV 
This type of problem can be simplified if we express the 
combination /c in units of eV - nm: 
he =1240eV-nm 
a 650nm 


E = 1.9leV 








The momentum is found in a similar way, using 
Eq. 3.22 


h_ ihe 1 
h ca Cc 





1240eV-nm 
P — 


= 1.9leV/c 
650 nm 


The momentum could also be found directly from the 
energy: 


= 1.91eV/c 





p= 
c 


BE 1 aley 
—  € 

(It may be helpful to review the discussion in Example 2.11 

about these units of momentum.) 

(b) Solving Eq. 3.21 for 2, we find 


he _ 1240eV-nm 


A= — 
E 2.40 eV 


= 517nm 








| Example 3.4 





The work function for tungsten metal is 4.52 eV. (a) What 
is the cutoff wavelength 4, for tungsten? (b) What is the 
maximum kinetic energy of the electrons when radiation 
of wavelength 198 nm is used? (c) What is the stopping 
potential in this case? 


Solution 
(a) Equation 3.25 gives 





2 he _ 1240eV-nm 


a = 274 
ae 4.52eV oo 


in the ultraviolet region. 


(b) At the shorter wavelength, 


he 
Le = =e 
124 : 
ee ee 
198 nm 


1.74eV 


(c) The stopping potential is the voltage corresponding 
to K, 


max* 





Kmax _ 1-74eV 
7 e 


V. = —max 


d =1.74V 
(4 





3.3 THERMAL RADIATION 


The second type of experiment we discuss that cannot be explained by the classical 
wave theory is thermal radiation, which is the electromagnetic radiation emitted 
by all objects because of their temperature. At room temperature the thermal 
radiation is mostly in the infrared region of the spectrum, where our eyes are not 
sensitive. As we heat objects to higher temperatures, they may emit visible light. 


A typical experimental arrangement is shown in Figure 3.12. An object is 
maintained at a temperature 7,. The radiation emitted by the object is detected 
by an apparatus that is sensitive to the wavelength of the radiation. For example, 
a dispersive medium such as a prism can be used so that different wavelengths 
appear at different angles 6. By moving the radiation detector to different angles 
6 we can measure the intensity* of the radiation at a specific wavelength. The 
detector is not a geometrical point (hardly an efficient detector!) but instead 
subtends a small range of angles A@, so what we really measure is the amount of 
radiation in some range A@ at 0, or, equivalently, in some range Ad at A. 

Many experiments were done in the late 19th century to study the wavelength 
spectrum of thermal radiation. These experiments, as we shall see, gave results that 
totally disagreed with the predictions of the classical theories of thermodynamics 
and electromagnetism; instead, the successful analysis of the experiments provided 
the first evidence of the quantization of energy, which would eventually be seen 
as the basis for the new quantum theory. 

Let’s first review the experimental results. The goal of these experiments was 
to measure the intensity of the radiation emitted by the object as a function of 
wavelength. Figure 3.13 shows a typical set of experimental results when the 
object is at a temperature 7, = 1000 K. If we now change the temperature of the 
object to a different value 7,, we obtain a different curve, as shown in Figure 3.13 
for T, = 1250K. If we repeat the measurement for many different temperatures, 
we obtain systematic results for the radiation intensity that reveal two important 
characteristics: 


1. The total intensity radiated over all wavelengths (that is, the area under each 
curve) increases as the temperature is increased. This is not a surprising result: 
we commonly observe that a glowing object glows brighter and thus radiates 
more energy as we increase its temperature. From careful measurement, we 
find that the total intensity increases as the fourth power of the absolute or 
kelvin temperature: 

I=oT* (3.26) 
where we have introduced the proportionality constant o. Equation 3.26 is 
called Stefan’s law and the constant o is called the Stefan-Boltzmann constant. 
Its value can be determined from experimental results such as those illustrated 
in Figure 3.13: 


o = 5.67037 x 10-* W/m? - K4 


2. The wavelength i,,,, at which the emitted intensity reaches its maximum 
value decreases as the temperature is increased, in inverse proportion to the 
temperature: A,,., « 1/7. From results such as those of Figure 3.13, we can 
determine the proportionality constant, so that 

Amax? = 2.8978 x 107? m-K 


m 


(3.27) 


This result is known as Wien’s displacement law; the term “displacement” 
refers to the way the peak is moved or displaced as the temperature is 


*As always, intensity means energy per unit time per unit area (or power per unit area), as in Eq. 3.10. 
Previously, “unit area” referred to the wave front, such as would be measured if we recorded the waves 
with an antenna of a certain area. Here, “unit area” indicates the electromagnetic radiation emitted 
from each unit area of the surface of the object whose thermal emissions are being observed. 
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FIGURE 3.12 Measurement of the 
spectrum of thermal radiation. A 
device such as a prism is used to sep- 


arate the wavelengths emitted by the 
object. 
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FIGURE 3.13 A possible result of the 
measurement of the radiation intensity 
overmany different wavelengths. Each 
different temperature of the emitting 
body gives a different peak d,,..,- 
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varied. Wien’s law is qualitatively consistent with our common observation 
that heated objects first begin to glow with a red color, and at higher temper- 
atures the color becomes more yellow. As the temperature is increased, the 
wavelength at which most of the radiation is emitted moves from the longer- 
wavelength (red) part of the visible region toward medium wavelengths. The 
term “white hot” refers to an object that is hot enough to produce the mixture 
of all wavelengths in the visible region to make white light. 


| Example 3.5 





(a) At what wavelength does a room-temperature (T = 
20°C) object emit the maximum thermal radiation? 
(6) To what temperature must we heat it until its peak 
thermal radiation is in the red region of the spectrum 
(A = 650 nm)? (c) How many times as much thermal 
radiation does it emit at the higher temperature? 


Solution 
(a) Using the absolute temperature, 7, = 273+ 20= 
293 K, Wien’s displacement law gives 


_ 2.8978 x 107? m-K 
max FF 
2.8978 x 10-3 m-K 
~ 293K 





= 9.89 wm 





This is in the infrared region of the electromagnetic 
spectrum. 


(b) For Amax = 650 nm, we again use Wien’s displacement 
law to find the new temperature T; : 


2.8978 x 10-3 m-K 
22> ry 





max 


_ 2.8978 x 10737 m-K 
~ 650 x 10-9 m 


= 4460 K 





(c) The total intensity of radiation is proportional to 7*, so 
the ratio of the total thermal emissions will be 


I, oT}  (4460K)* 
I, oT? (293K)? 
= 5.37 x 10° 





Be sure to notice the use of absolute (kelvin) temperatures 
in this example. 





























FIGURE 3.14 A cavity filled with 
electromagnetic radiation in thermal 
equilibrium with its walls at tem- 
perature 7. Some radiation escapes 
through the hole, which represents an 
ideal blackbody. 


The theoretical analysis of the emission of thermal radiation from an arbitrary 
object is extremely complicated. It depends on details of the surface properties 
of the object, and it also depends on how much radiation the object reflects from 
its surroundings. To simplify our analysis, we consider a special type of object 
called a blackbody, which absorbs all radiation incident on it and reflects none of 
the incident radiation. 

To simplify further, we consider a special type of blackbody: a hole in a 
hollow metal box whose walls are in thermal equilibrium at temperature 7. The 
box is filled with electromagnetic radiation that is emitted and reflected by the 
walls. A small hole in one wall of the box allows some of the radiation to escape 
(Figure 3.14). It is the hole, and not the box itself, that is the blackbody. Radiation 
from outside that is incident on the hole gets lost inside the box and has a 
negligible chance of reemerging from the hole; thus no reflections occur from the 
blackbody (the hole). The radiation that emerges from the hole is just a sample 
of the radiation inside the box, so understanding the nature of the radiation inside 
the box allows us to understand the radiation that leaves through the hole. 


Let’s consider the radiation inside the box. It has an energy density (energy 
per unit volume) per unit wavelength interval u(A). That is, if we could look into 
the interior of the box and measure the energy density of the electromagnetic 
radiation with wavelengths between 4 and A+ dA in a small volume element, 
the result would be u(A)dd. For the radiation in this wavelength interval, what 
is the corresponding intensity (power per unit area) emerging from the hole? At 
any particular instant, half of the radiation in the box will be moving away from 
the hole. The other half of the radiation is moving toward the hole at velocity 
of magnitude c but directed over a range of angles. Averaging over this range 
of angles to evaluate the energy flowing perpendicular to the surface of the hole 
introduces another factor of !/2, so the contribution of the radiation in this small 
wavelength interval to the intensity passing through the hole is 


10} = zu) (3.28) 


The quantity /(A)dd is the radiant intensity in the small interval dA at the 
wavelength 4. This is the quantity whose measurement gives the results displayed 
in Figure 3.13. Each data point represents a measurement of the intensity in a small 
wavelength interval. The goal of the theoretical analysis is to find a mathematical 
function /(A) that gives a smooth fit through the data points of Figure 3.13. 

If we wish to find the total intensity emitted in the region between wavelengths 
A, and 2,, we divide the region into narrow intervals dd and add the intensities in 
each interval, which is equivalent to the integral between those limits: 


x 
I(AytA) = i *1(A) dd (3.29) 


Ay 


This is similar to Eq. 1.27 for determining the number of molecules with energies 
between two limits. The total emitted intensity can be found by integrating over 
all wavelengths: 


i= . * TA) dh (3.30) 
0 


This total intensity should work out to be proportional to the 4th power of the 
temperature, as required by Stefan’s law (Eq. 3.26). 


Classical Theory of Thermal Radiation 


Before discussing the quantum theory of thermal radiation, let’s see what the 
classical theories of electromagnetism and thermodynamics can tell us about the 
dependence of J on 4. The complete derivation is not given here, only a brief 
outline of the theory.* The derivation involves first computing the amount of 
radiation (number of waves) at each wavelength and then finding the contribution 
of each wave to the total energy in the box. 


*For amore complete derivation, see R. Eisberg and R. Resnick, Quantum Theory of Atoms, Molecules, 
Solids, Nuclei, and Particles, 2nd edition (Wiley, 1985), pp. 9-13. 
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1. The box is filled with electromagnetic standing waves. If the walls of the box 
are metal, radiation is reflected back and forth with a node of the electric field 
at each wall (the electric field must vanish inside a conductor). This is the 
same condition that applies to other standing waves, like those on a stretched 
string or a column of air in an organ pipe. 

2. The number of standing waves with wavelengths between i and x. + dd is 


V 
N(A)dh = = dr (3.31) 


where V is the volume of the box. For one-dimensional standing waves, as 
on a stretched string of length L, the allowed wavelength are 1 = 2L/n, (n = 
1,2,3,...). The number of possible standing waves with wavelengths between 
A, and A, 1s ny —n, = 2L(1/A, —1/,). In the small interval from 1 to 
i+ dd, the number of standing waves is N(A)da = |dn/dal|dd = (2L/27)dhd. 
Equation 3.31 can be obtained by extending this approach to three dimensions. 

3. Each individual wave contributes an average energy of kT to the radiation 
in the box. This result follows from an analysis similar to that of Section 1.3 
for the statistical mechanics of gas molecules. In this case we are interested 
in the statistics of the oscillating atoms in the walls of the cavity, which are 
responsible for setting up the standing electromagnetic waves in the cavity. 
For a one-dimensional oscillator, the energies are distributed according to the 
Maxwell-Boltzmann distribution:* 


N 
N(E£) = ae (3.32) 


Recall from Section 1.3 that V(£) is defined so that the number of oscillators 
with energies between E and E+ dE is dN = N(E)dE, and thus the total 
number of oscillators at all energies is f dN = i, N(E)dE, which (as you 
should show) works out to V. The average energy per oscillator is then found 
in the same way as the average energy of a gas molecule (Eq. 1.25): 


Le L ye —E/kT 
Ew = : EN(G)dE = 7, ‘ Ee dE (3.33) 


which does indeed work out to £,,, = kT. 


Putting all these ingredients together, we can find the energy density of 
radiation in the wavelength interval dA inside the cavity: energy density = (number 
of standing waves per unit volume) x (average energy per standing wave) or 


N(A) da 87 








A)dvA = kT = —kT dd 3.34 
u(A) Vy aa (3.34) 
The corresponding intensity per unit wavelength interval dA is 
c c 8x 2c 
IQA) = gu@) =a kT = ia kT (3.35) 


This result is known as the Rayleigh-Jeans formula; based firmly on the classical 
theories of electromagnetism and thermodynamics, it represents our best attempt 


*The exponential part of this expression is that same as that of Eq. 1.22 for gas molecules, but the rest of 
the equation is different, because the statistical behavior of one-dimensional oscillators is different from 
that of gas molecules moving in three dimensions. We’ll consider these calculations in greater detail in 
Chapter 10. 


to apply classical physics to understanding the problem of blackbody radiation. 
In Figure 3.15 the intensity calculated from the Rayleigh-Jeans formula is 
compared with typical experimental results. The intensity calculated with 
Eq. 3.35 approaches the data at long wavelengths, but at short wavelengths, the 
classical theory (which predicts u — oo as A — 0) fails miserably. The failure 
of the Rayleigh-Jeans formula at short wavelengths is known as the ultraviolet 
catastrophe and represents a serious problem for classical physics, because the 
theories of thermodynamics and electromagnetism on which the Rayleigh-Jeans 
formula is based have been carefully tested in many other circumstances and 
found to give extremely good agreement with experiment. It is apparent in the 
case of blackbody radiation that the classical theories do not work, and that a 
new kind of physical theory is needed. 


Quantum Theory of Thermal Radiation 


The new physics that gave the correct interpretation of thermal radiation was 
proposed by the German physicist Max Planck in 1900. The ultraviolet catastrophe 
occurs because the Rayleigh-Jeans formula predicts too much intensity at short 
wavelengths (or equivalently at high frequencies). What is needed is a way to 
make u— 0 as A > 0, or as f > oo. Again considering the electromagnetic 
standing waves to result from the oscillations of atoms in the walls of the cavity, 
Planck tried to find a way to reduce the number of high-frequency standing 
waves by reducing the number of high-frequency oscillators. He did this by a 
bold assumption that formed the cornerstone of a new physical theory, guantum 
physics. Associated with this theory is a new version of mechanics, known 
as wave mechanics or quantum mechanics. We discuss the methods of wave 
mechanics in Chapter 5; for now we show how Planck’s theory provided the 
correct interpretation of the emission spectrum of thermal radiation. 

Planck suggested that an oscillating atom can absorb or emit energy only in 
discrete bundles. This bold suggestion was necessary to keep the average energy 
of a low-frequency (long-wavelength) oscillator equal to AT (in agreement with 
the Rayleigh-Jeans law at long wavelength), but it also made the average energy 
of a high-frequency (low-wavelength) oscillator approach zero. Let’s see how 
Planck managed this remarkable feat. 

In Planck’s theory, each oscillator can emit or absorb energy only in quantities 
that are integer multiples of a certain basic quantity of energy «, 


BE, =ne n= 1,2,3,... (3.36) 


where n is the number of quanta. Furthermore, the energy of each of the quanta is 
determined by the frequency 


e=hf 


where / is the constant of proportionality, now known as Planck’s constant. From 
the mathematical standpoint, the difference between Planck’s calculation and 
the classical calculation using Maxwell-Boltzmann statistics is that the energy 
of an oscillator at a certain wavelength or frequency is no longer a continuous 
variable—it is a discrete variable that takes only the values given by Eq. 3.36. 
The integrals in the classical calculation are then replaced by sums, and the 
number of oscillators with energy &,, is then 


(3.37) 


N, = NI — el ent (3.38) 
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FIGURE 3.15 The failure of the clas- 
sical Rayleigh-Jeans formula to fit 
the observed intensity. At long wave- 
lengths the theory approaches the data, 
but at short wavelengths the classical 
formula fails miserably. 
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Max Planck (1858-1947, Germany). 
His work on the spectral distribution 
of radiation, which led to the quan- 
tum theory, was honored with the 
1918 Nobel Prize. In his later years, 
he wrote extensively on religious and 
philosophical topics. 
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FIGURE 3.16 Planck’s function fits 
the observed data perfectly. 
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FIGURE 3.17 Data from the COBE 
satellite, launched in 1989 to deter- 
mine the temperature of the cosmic 
microwave background radiation from 
the early universe. The data points 
exactly fit the Planck function corre- 
sponding to a temperature of 2.725 K. 
To appreciate the remarkable precision 
of this experiment, note that the sizes 
of the error bars have been increased 
by a factor of 400 to make them vis- 
ible! (Source: NASA Office of Space 
Science) 


(Compare this result with Eq. 3.32 for the continuous case.) Here N,, represents 
the number of oscillators with energy £,,, while N is the total number. You should 


n? 


Cc 
be able to show that )* N,, = N, again giving the total number of oscillators when 


n=0 
summed over all possible energies. Planck’s calculation then gives the average 
energy: 





1 foe) foe) 
Eqy = 57 2 NaBn = (1 — M7) YT (ne yee (3.39) 
n=0 n=0 
which gives (see Problem 14) 
g hf he/r 
Ew ec/kT 1] ehf/KT | ehc/ MT _ | (3.40) 


Note from this equation that £,,, = kT at small f (large 4) but that £,, — 0 at 
large f (small A). Thus the small-wavelength oscillators carry a vanishingly small 
energy, and the ultraviolet catastrophe is solved! 

Based on Planck’s result, the intensity of the radiation then becomes (using 
Eqs. 3.28 and 3.31): 


c (8x he/r 2m hc* 1 
1s = 
4\ 4 ehc/AkT =] 5 ehc/AkT a] 


(An alternative approach to deriving this result is given in Section 10.6.) The 
perfect agreement between experiment and Planck’s formula is illustrated in 
Figure 3.16. 

In Problems 15 and 16 at the end of this chapter you will demonstrate that 
Planck’s formula can be used to deduce Stefan’s law and Wien’s displacement 
law. In fact, deducing Stefan’s law from Planck’s formula results in a relationship 
between the Stefan-Boltzmann constant and Planck’s constant: 


_ Qn kA 
o= eh 


By determining the value of the Stefan-Boltzmann constant from the intensity 
data available in 1900, Planck was able to determine a value of the constant h : 





(3.41) 


(3.42) 


h = 6.56 x 1074 J-s 


which agrees very well with the value of A that Millikan deduced 15 years later 
based on the analysis of data from the photoelectric effect. The good agreement 
of these two values is remarkable, because they are derived from very different 
kinds of experiments—one involves the emission and the other the absorption of 
electromagnetic radiation. This suggests that the quantization property is not an 
accident arising from the analysis of one particular experiment, but is instead a 
property of the electromagnetic field itself. Along with many other scientists of 
his era, Planck was slow to accept this interpretation. However, later experimental 
evidence (including the Compton effect) proved to be so compelling that it left 
no doubt about Einstein’s photon theory and the particlelike structure of the 
electromagnetic field. 
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Planck’s formula still finds important applications today in the measurement 
of temperature. By measuring the intensity of radiation emitted by an object 
at a particular wavelength (or, as in actual experiments, in a small interval of 
wavelengths), Eq. 3.41 can be used to deduce the temperature of the object. Note 
that only one measurement, at any wavelength, is all that is required to obtain 
the temperature. A radiometer is a device for measuring the intensity of thermal 
radiation at selected wavelengths, enabling a determination of temperature. 
Radiometers in orbiting satellites are used to measure the temperature of the land 
and sea areas of the Earth and of the upper surface of clouds. Other orbiting 
radiometers have been aimed toward “empty space” to measure the temperature 
of the radiation from the early history of the universe (Figure 3.17). 


| Example 3.6 





You are using a radiometer to observe the thermal radiation The given temperature corresponds to kT = (8.6174 x 
from an object that is heated to maintain its temperature at 107> eV/K)(1278 K) = 0.1101 eV. The radiation intensity 
1278 K. The radiometer records radiation in a wavelength in this small wavelength interval is 
interval of 12.6nm. By changing the wavelength at which 
you are measuring, you set the radiometer to record the IQ)dh = 2mhe* 1 dd 
most intense radiation emission from the object. What is AS ehe/AKT _ | 
the intensity of the emitted radiation in this interval? 

= 21 (6.626 x 10-34 J-s)(2.998 x 108 m/s) 
salaunn a x (12.6 x 10-9 m)(2.267 x 1076 m)~ 
The wavelength setting for the most intense radiation is x. (e(t240 eVam) (2267 nm)(0.1101 eV) _ 4-1 
determined from Wien’s displacement law: 


= 552 W/m? 
_ 2.8978 x 10-3m-K _ 2.8978 x 107? m-K 
mea T ~ 1278 K 


= 2.267 x 10°° m = 2267 nm 











3.4 THE COMPTON EFFECT 


Another way for radiation to interact with matter is by means of the Compton 
effect, in which radiation scatters from loosely bound, nearly free electrons. Part 
of the energy of the radiation is given to the electron; the remainder of the energy 
is reradiated as electromagnetic radiation. According to the wave picture, the 
scattered radiation is less energetic than the incident radiation (the difference 
going into the kinetic energy of the electron) but has the same wavelength. As we 
will see, the photon concept leads to a very different prediction for the scattered 
radiation. 

The scattering process is analyzed simply as an interaction (a “collision” in 
the classical sense of particles) between a single photon and an electron, which 
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Compton scattering. 





Arthur H. Compton (1892-1962, 
United States). His work on X-ray scat- 
tering verified Einstein’s photon theory 
and earned him the 1927 Nobel Prize. 
He was a pioneer in research with X 
rays and cosmic rays. During World 
War IThe directed a portion of the U.S. 
atomic bomb research. 


we assume to be at rest. Figure 3.18 shows the process. Initially, the photon has 
energy £ and linear momentum p given by 

eee and yee (3.43) 

Xr c 

The electron, initially at rest, has rest energy m,c’. After the scattering, the photon 
has energy E” = hc/d! and momentum p’ = E’/c, and it moves in a direction at an 
angle @ with respect to the direction of the incident photon. The electron has total 
final energy E, and momentum p, and moves in a direction at an angle @ with 
respect to the initial photon. (To allow for the possibility of high-energy incident 
photons giving energetic scattered electrons, we use relativistic kinematics for the 
electron.) The conservation laws for total relativistic energy and momentum are 
then applied: 


Fvnitiat = Efinat * E+mc =E' +E, (3.44) 
Px jnitial = Py.final * P=p.cos¢+p' cos 0 (3.44b) 
Py initial = Py,final * 0=p, sing — p’ sin@ (3.44c) 


We have three equations with four unknowns (6, ¢, E,,E’; p. and p’ are not 
independent unknowns) that cannot be solved uniquely, but we can eliminate any 
two of the four unknowns by solving the equations simultaneously. If we choose 
to measure the energy and direction of the scattered photon, we eliminate £, and 
o. The angle ¢ is eliminated by first rewriting the momentum equations: 


p.cos¢=p—p'cos@ and p, sing =p’ sind (3.45) 
Squaring these equations and adding the results, we obtain 
De =p” — 2pp' cosé + p” (3.46) 


The relativistic relationship between energy and momentum is, according to 
Eq. 2.39, E2 = c*p2 + m2c*. Substituting in this equation for E, from Eq. 3.44a 
and for p2 from Eq. 3.46, we obtain 





(E+m,c — E'? = c(p* — 2pp' cos@ +p”) + mec* (3.47) 
and after a bit of algebra, we find 
1 1 1 
ar a ne (1 — cos 6) (3.48) 
In terms of wavelength, this equation can also be written as 
N-AS= ae — cos@) (3.49) 


e 


where A is the wavelength of the incident photon and i’ is the wavelength of the 
scattered photon. The quantity h/m,c 1s known as the Compton wavelength of the 
electron and has a value of 0.002426 nm; however, keep in mind that it is not a 
true wavelength but rather is a change of wavelength. 

Equations 3.48 and 3.49 give the change in energy or wavelength of the photon, 
as a function of the scattering angle 0. Because the quantity on the right-hand side 
is never negative, E’ is always less than E, so that the scattered photon has less 
energy than the original incident photon; the difference E — E’ is just the kinetic 
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energy given to the electron, E, — m,c’. Similarly, 4’ is greater than 4, meaning 
the scattered photon always has a longer wavelength than the incident photon; the 
change in wavelength ranges from 0 at 9 = 0° to twice the Compton wavelength 
at 6 = 180°. Of course the descriptions in terms of energy and wavelength are 
equivalent, and the choice of which to use is merely a matter of convenience. 

Using E, = K, + m,c’, where K, is the kinetic energy of the electron, conser- 
vation of energy (Eq. 3.44a) can also be written as E + mc? = E’ + K. +m,c’. 
Solving for K,, we obtain 


K,=E-£' (3.50) 


That is, the kinetic energy acquired by the electron is equal to the difference 
between the initial and final photon energies. 

We can also find the direction of the electron’s motion by dividing the two 
momentum relationships in Equation 3.45: 


Hig? psné _— E’siné 





= = 3.51 
P.cosd p-—p'cos@ E-—E'cosé ( ) 


where the last result comes from using p = E/c and p’ = E"/c. 


| Example 3.7 
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X rays of wavelength 0.2400 nm are Compton-scattered, (b) The energy E” can be found directly from i’ : 
and the scattered beam is observed at an angle of 60.0° 

relative to the incident beam. Find: (a) the wavelength f= he = 1240 eV -nm asiey 

of the scattered X rays, (b) the energy of the scattered nu 0.2412 nm 

X-ray photons, (c) the kinetic energy of the scattered 
electrons, and (d) the direction of travel of the scattered 








(c) The initial photon energy E is hc/A = 5167 eV, so 





SeenOns: K, = E—EF' =5167eV —5141eV = 26eV 
Solution (d) From Eq. 3.51, 
(a) 2’ can be found immediately from Eq. 3.49: 
$= tan~ E’ sin 
h = tan” *° ———_—___ 
M=A+ (1 — cos @) E— E'cosé 
M,C ete (5141 eV)(sin 60°) 
= 0.2400 + (0.00243 nm)(1 — cos 60°) ~ a" 6167 eV) — 141 eV) (cos 60°) 
= 0.2412 nm = 59.7° 





The first experimental demonstration of this type of scattering was done by 
Arthur Compton in 1923. A diagram of his experimental arrangement is shown in 
Figure 3.19. A beam of X rays of a single wavelength A is incident on a scattering 
target, for which Compton used carbon. (Although no scattering target contains 
actual “free” electrons, the outer or valence electrons in many materials are very 
weakly attached to the atom and behave like nearly free electrons. The binding 
energies of these electrons in the atom are so small compared with the energies of 
the incident X-ray photons that they can be regarded as nearly “free” electrons.) A 
movable detector measured the energy of the scattered X rays at various angles 0. 
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FIGURE 3.20 Compton’s original re- 
sults for X-ray scattering. 
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FIGURE 3.19 Schematic diagram of Compton-scattering apparatus. The wave- 
length A’ of the scattered X rays is measured by the detector, which can be moved 
to different positions 6. The wavelength difference 4’ — i varies with 0. 


Compton’s original results are illustrated in Figure 3.20. At each angle, 
two peaks appear, corresponding to scattered X-ray photons with two different 
energies or wavelengths. The wavelength of one peak does not change as the 
angle is varied; this peak corresponds to scattering that involves “inner” electrons 
of the atom, which are more tightly bound to the atom so that the photon can 
scatter with no loss of energy. The wavelength of the other peak, however, varies 
strongly with angle; as can be seen from Figure 3.21, this variation is exactly as 
the Compton formula predicts. 

Similar results can be obtained for the scattering of gamma rays, which 
are higher-energy (shorter wavelength) photons emitted in various radioactive 
decays. Compton also measured the variation in wavelength of scattered 
gamma rays, as illustrated in Figure 3.22. The change in wavelength in the 
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FIGURE 3.21 The scattered X-ray 
wavelengths i’, from Figure 3.20, 
for different scattering angles. The 
expected slope is 2.43 x 107!? m, in 
agreement with the measured slope 
of Compton’s data points. 


FIGURE 3.22 Compton’s results for 
gamma-ray scattering. The wave- 
lengths are much smaller than for 
X-rays, but the slope is the same as 
in Figure 3.21, which the Compton 
formula, Eq. 3.49, predicts. 
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gamma-ray measurements is identical with the change in wavelength in the X-ray 
measurements, as Eq. 3.49 predicts —the change in wavelength does not depend 
on the incident wavelength. 


3.5 OTHER PHOTON PROCESSES 


Although thermal radiation, the photoelectric effect, and Compton scattering pro- 
vided the earliest experimental evidence in support of the quantization (particlelike 
behavior) of electromagnetic radiation, there are numerous other experiments that 
can also be interpreted correctly only if we assume the existence of photons as 
discrete quanta of electromagnetic radiation. In this section we discuss some of 
these processes, which cannot be understood if we consider only the wave nature 
of electromagnetic radiation. As you study the descriptions of these processes, 
note how photons interact with atoms or electrons by delivering energy in discrete 
bundles, in contrast to the wave interpretation in which the energy can be regarded 
as arriving continuously. 


Interactions of Photons with Atoms 


The emission of electromagnetic radiation from atoms takes place in discrete 
amounts characterized by one or more photons. When an atom emits a photon 
of energy E, the atom loses an equivalent amount of energy. Consider an atom 
at rest that has an initial energy E;. The atom emits a photon of energy £. After 
the emission, the atom is left with a final energy E,, which we will take as the 
energy associated with the internal structure of the atom. Because of conservation 
of momentum, the final atom must have a momentum that is equal and opposite 
to the momentum of the emitted photon, so the atom must also have a “recoil” 
kinetic energy K. (Normally this kinetic energy is very small.) Conservation of 
energy then gives 


E,=E;+K+E or E=(E,—£E;)—-K (3.52) 


The energy of the emitted photon is equal to the net energy lost by the atom, 
minus a negligibly small contribution to the recoil kinetic energy of the atom. 

In the reverse process, an atom can absorb a photon of energy E. If the atom 
is initially at rest, it must again acquire a small recoil kinetic energy in order to 
conserve momentum. Now conservation of energy gives 


The energy available to add to the atom’s internal supply of energy is the photon 
energy, less a recoil kinetic energy that is usually negligible. 

Photon emission and absorption experiments are among the most important 
techniques for acquiring information about the internal structure of atoms, as we 
discuss in Chapter 6. 
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Bremsstrahlung and X-ray Production 


When an electric charge, such as an electron, is accelerated or decelerated, it 
radiates electromagnetic energy; according to the quantum interpretation, we 
would say that it emits photons. Suppose we have a beam of electrons, which 
has been accelerated through a potential difference AV, so that the electrons 
experience a loss in potential energy of —e AV and thus acquire a kinetic energy 
of K = e AV (Figure 3.23). When the electrons strike a target they are slowed 
down and eventually come to rest, because they make collisions with the atoms 
of the target material. In such a collision, momentum is transferred to the atom, 
the electron slows down, and photons are emitted. The recoil kinetic energy of 
the atom is small (because the atom is so massive) and can safely be neglected. If 
the electron has a kinetic energy K before the encounter and if it leaves after the 
collision with a smaller kinetic energy K’, then the photon energy Af = hc/a is 


h 
hf = — =e 4 (3.54) 


The amount of energy lost, and therefore the energy and wavelength of the 
emitted photon, are not uniquely determined, because K is the only known energy 
in Eq. 3.54. An electron usually will make many collisions, and therefore emit 
many different photons, before it is brought to rest; the photons then will range 
all the way from very small energies (large wavelengths) corresponding to small 
energy losses, up to a maximum photon energy Af,,,, equal to K, corresponding 
to an electron that loses all of its kinetic energy K in a single encounter (that is, 
when K’ = 0). The smallest emitted wavelength A,,;, is therefore determined by 
the maximum possible energy loss, 
he he 


= a55 
min K e AV ( ) 


my 








X-ray photon 





Target atom K’ 





(b) 


FIGURE 3.23 (a) Apparatus for producing bremsstrahlung. Electrons from a 
cathode C are accelerated to the anode A through the potential difference AV. 
When an electron encounters a target atom of the anode, it can lose energy, with the 
accompanying emission of an X-ray photon. (b) A schematic representation of the 
bremsstrahlung process. 


For typical accelerating voltages in the range of 10,000 V, 4,,;, is in the range of 
a few tenths of nm, which corresponds to the X-ray region of the spectrum. This 
continuous distribution of X rays (which is very different from the discrete X-ray 
energies that are emitted in atomic transitions; more about these in Chapter 8) is 
called bremsstrahlung, which is German for braking, or decelerating, radiation. 
Some sample bremsstrahlung spectra are illustrated in Figure 3.24. 

Symbolically we can write the bremsstrahlung process as 


electron — electron + photon 
This is just the reverse process of the photoelectric effect, which is 
electron + photon — electron 


However, neither process occurs for free electrons. In both cases there must be a 
heavy atom in the neighborhood to take care of the recoil momentum. 


Pair Production and Annihilation 


Another process that can occur when photons encounter atoms is pair production, 
in which the photon loses all its energy and in the process two particles are 
created: an electron and a positron. (A positron is a particle that is identical in 
mass to the electron but has a positive electric charge; more about antiparticles 
in Chapter 14.) Here we have an example of the creation of rest energy. The 
electron did not exist before the encounter of the photon with the atom (it was not 
an electron that was part of the atom). The photon energy hf is converted into the 
relativistic total energies E, and £_ of the positron and electron: 


hf = E,,+E_ = (m,c? + K,) + (m,c? + K_) (3.56) 
Because K, and K_ are always positive, the photon must have an energy of 
at least 2m.c? = 1.02 MeV in order for this process to occur; such high-energy 
photons are in the region of nuclear gamma rays. Symbolically, 


photon — electron + positron 


This process, like bremsstrahlung, will not occur unless there is an atom nearby 
to supply the necessary recoil momentum. The reverse process, 


electron + positron — photon 
also occurs; this process is known as electron-positron annihilation and can occur 
for free electrons and positrons as long as at least two photons are created. In 


this process the electron and positron disappear and are replaced by two photons. 
Conservation of energy requires that 


(mc? + K,) + (mc? + K_) = E, +E, (3.57) 
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FIGURE 3.24 Some typical brems- 
strahlung spectra. Each spectrum is 
labeled with the value of the acceler- 
ating voltage AV. 
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FIGURE 3.25 Apparatus for delayed 
choice experiment. Photons from the 
laser strike the beam splitter and can 
then travel paths A or B. The switch 
in path A can deflect the beam into 
a detector. If the switch is off, the 
beam on path A recombines with the 
beam on path B to form an interfer- 
ence pattern. [Source: A. Shimony, 
“The Reality of the Quantum World,” 
Scientific American 258, 46 (January 
1988)]. 


where £, and E, are the photon energies. Usually the kinetic energies K, and K_ 
are negligibly small, so we can assume the positron and electron to be essentially 
at rest. Momentum conservation then requires the two photons to have equal and 
opposite momenta and thus equal energies. The two annihilation photons have 
equal energies of 0.511 MeV (= m,c’) and move in exactly opposite directions. 


3.6 WHAT IS A PHOTON? 


We can describe photons by giving a few of their basic properties: 


¢ like an electromagnetic wave, photons move with the speed of light; 

¢ they have zero mass and rest energy; 

¢ they carry energy and momentum, which are related to the frequency and 
wavelength of the electromagnetic wave by E = Afandp =h/d ; 

¢ they can be created or destroyed when radiation is emitted or absorbed; 

¢ they can have particlelike collisions with other particles such as electrons. 


In this chapter we have described some experiments that favor the photon 
interpretation of electromagnetic radiation, according to which the energy of the 
radiation is concentrated in small bundles. Other experiments, such as interference 
and diffraction, favor the wave interpretation, according to which the energy of 
the radiation is spread over its entire wavefront. For example, the explanation of 
the double-slit interference experiment requires that the wavefront be divided so 
that some of its intensity can pass through each slit. A particle must choose to go 
through one slit or the other; only a wave can go through both. 

If we regard the wave and particle pictures as valid but exclusive alternatives, 
we must assume that the light emitted by a source must travel either as waves or 
as particles. How does the source know what kind of light (particles or waves) to 
emit? Suppose we place a double-slit apparatus on one side of the source and a 
photoelectric cell on the other side. Light emitted toward the double slit behaves 
like a wave and light emitted toward the photocell behaves like particles. How 
did the source know in which direction to aim the waves and in which direction 
to aim the particles? 

Perhaps nature has a sort of “secret code” in which the kind of experiment we 
are doing is signaled back to the source so that it knows whether to emit particles 
or waves. Let us repeat our dual experiment with light from a distant galaxy, 
light that has been traveling toward us for a time roughly equal to the age of 
the universe (13 x 10° years). Surely the kind of experiment we are doing could 
not be signaled back to the limits of the known universe in the time it takes us 
to remove the double-slit apparatus from the laboratory table and replace it with 
the photoelectric apparatus. Yet we find that the starlight can produce both the 
double-slit interference and also the photoelectric effect. 

Figure 3.25 shows a recent experiment that was designed to test whether this 
dual nature is an intrinsic property of light or of our apparatus. A light beam 
from a laser goes through a beam splitter, which separates the beam into two 
components (A and B). The mirrors reflect the two component beams so that they 
can recombine to form an interference pattern. In path A there is a switch that can 
deflect the beam into a detector. If the switch is off, beam A is not deflected and 
will combine with beam B to produce the interference pattern. If the switch is on, 


beam A is deflected and observed in the detector, indicating that the light traveled 
a definite path, as would be characteristic of a particle. To put this another way, if 
the switch is off, the light beam is observed as a wave; if it is on, the light beam 
is observed as particles. 

If light behaves like particles, the beam splitter sends it along either path A 
or path B; either path can be randomly chosen for the particle, but each particle 
can travel only one path. If light behaves like a wave, on the other hand, the beam 
splitter sends it along both paths, dividing its intensity between the two. Perhaps 
the beam splitter can somehow sense whether the switch is open or closed, so 
that it knows whether we are doing a particle-type or a wave-type experiment. 
If this were true, then the beam splitter would “know” whether to send all of 
the intensity down one path (so that we would observe a particle) or to split 
the intensity between the paths (so that we would observe a wave). However, in 
this experiment the experimenters used a very fast optical switch whose response 
time was shorter than the time it takes for light to travel through the apparatus 
to the switch. That is, the state of the switch could be changed after the light 
had already passed through the beam splitter, and so it was impossible for the 
beam splitter to “know” how the switch was set and thus whether a particle-type 
or a wave-type experiment was being done. This kind of experiment is called a 
“delayed choice” experiment, because the experimenter makes the choice of what 
kind of experiment to do after the light is already traveling on its way to the 
observation apparatus. 

In this experiment, the investigators discovered that whenever they had the 
switch off, they observed the interference pattern characteristic of waves. When 
they had the switch on, they observed particles in the detector and no interference 
pattern. That is, whenever they did a wave-type experiment they observed waves, 
and whenever they did a particle-type experiment they observed particles. The 
wave and particle natures are both present simultaneously in the light, and this dual 
nature is clearly associated with the light and is not characteristic of the apparatus. 

Many other experiments of this type have been done, and they all produce 
similar results. We are therefore trapped into an uncomfortable conclusion: Light 
is not either particles or waves; it is somehow both particles and waves, and 
only shows one or the other aspect, depending on the kind of experiment we are 
doing. A particle-type experiment shows the particle nature, while a wave-type 
experiment shows the wave nature. Our failure to classify light as either particle 
or wave is not so much a failure to understand the nature of light as it is a failure of 
our limited vocabulary (based on experiences with ordinary particles and waves) 
to describe a phenomenon that is more elegant and mysterious than either simple 
particles or waves. 


Wave-Particle Duality 


The dilemma of the dual particle+wave nature of light, which is called wave- 
particle duality, cannot be resolved with a simple explanation; physicists and 
philosophers have struggled with this problem ever since the quantum theory was 
introduced. The best we can do is to say that neither the wave nor the particle 
picture is wholly correct all of the time, that both are needed for a complete 
description of physical phenomena, and that in fact the two are complementary to 
one another. 

Suppose we use a photographic film to observe the double-slit interference 
pattern. The film responds to individual photons. When a single photon is absorbed 
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by the film, a single grain of the photographic emulsion is darkened; a complete 
picture requires a large number of grains to be darkened. 

Let us imagine for the moment that we could see individual grains of the film 
as they absorbed photons and darkened, and let us do the double-slit experiment 
with a light source that is so weak that there is a relatively long time interval 
between photons. We would see first one grain darken, then another, and so forth, 
until after a large number of photons we would see the interference pattern begin 
to emerge. Some areas of the film (the interference maxima) show evidence for 
the arrival of a large number of photons, while in other areas (the interference 
minima) few photons arrive. 

Alternatively, the wave picture of the double-slit experiment suggests that we 
could find the net electric field of the wave that strikes the screen by superimposing 
the electric fields of the portions of the incident wave fronts that pass through the 
two slits; the intensity or power in that combined wave could then be found by a 
procedure similar to Eqs. 3.7 through 3.10, and we would expect that the resultant 
intensity should show maxima and minima just like the observed double-slit 
interference pattern. 

In summary, the correct explanation of the origin and appearance of the 
interference pattern comes from the wave picture, and the correct interpretation 
of the evolution of the pattern on the film comes from the photon picture; the 
two explanations, which according to our limited vocabulary and common-sense 
experience cannot simultaneously be correct, must somehow be taken together to 
give a complete description of the properties of electromagnetic radiation. 

Keep in mind that “photon” and “wave” represent descriptions of the behavior 
of electromagnetic radiation when it encounters material objects. It is not correct 
to think of light as being “composed” of photons, just as we don’t think of light as 
being “composed” of waves. The explanation in terms of photons applies to some 
interactions of radiation with matter, while the explanation in terms of waves 
applies to other interactions. For example, when we say that an atom “emits” a 
photon, we don’t mean that there is a supply of photons stored within the atom; 
instead, we mean that the atom has given up a quantity of its internal energy to 
create an equivalent amount of energy in the form of electromagnetic radiation. 

In the case of the double-slit experiment, we might reason as follows: the 
interaction between a “source” of radiation and the electromagnetic field is 
quantized, so that we can think of the emission of radiation by the atoms of the 
source in terms of individual photons. The interaction at the opposite end of the 
experiment, the photographic film, is also quantized, and we have the similarly 
useful view of atoms absorbing radiation as individual photons. In between, the 
electromagnetic radiation propagates smoothly and continuously as a wave and 
can show wave-type behavior (interference or diffraction) when it encounters the 
double slit. 

Where the wave has large intensity, the film reveals the presence of many 
photons; where the wave has small intensity, few photons are observed. Recalling 
that the intensity of the wave is proportional to the square of its amplitude, we 
then have 


probability to observe photons « |electric field amplitude]? 


It is this expression that provides the ultimate connection between the wave 
behavior and the particle behavior, and we will see in the next two chapters that 


a similar expression connects the wave and the particle aspects of those objects, 
such as electrons, which have been previously considered to behave as classical 


particles. 
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Suppose you wanted to study the diffraction of photons 
by nuclei. What energy of photons would you choose? 
Why? 

How is the wave nature of light unable to account for the 
observed properties of the photoelectric effect? 

In the photoelectric effect, why do some electrons have 
kinetic energies smaller than K,,,, ? 

Why doesn’t the photoelectric effect work for free electrons? 
What does the work function tell us about the properties of a 
metal? Of the metals listed in Table 3.1, which has the least 
tightly bound electrons? Which has the most tightly bound? 
Electric current is charge flowing per unit time. If we increase 
the kinetic energy of the photoelectrons (by increasing 
the energy of the incident photons), shouldn’t the cur- 
rent increase, because the charge flows more rapidly? Why 
doesn’t it? 

What might be the effects on a photoelectric effect exper- 
iment if we were to double the frequency of the incident 
light? If we were to double the wavelength? If we were to 
double the intensity? 


10. 


11, 


12. 


Section Section 
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Double-slit ai 0,1,2,3,... 3.1 | Rayleigh-Jeans = /(A) = sr kt 3.3 
maxima formula 
n= = ie 2nh? 1 
Bragg s law for 2dsind=ndA n=1,2,3, Sel | Ditech blackbody (A) = = 33 
X-ray diffraction eae Ad ehc/AKT _ ] 
distribution 

Energy of photon E=hf =hc/r 3.2 1 1 

- ' ee if / Compton ; = 7 (1 — cos @), 3.4 
Maximum kinetic K,,,, =eV,=hf —@ 3.2 scattering EB’ OE mee 
energy of MV —’ = —(1—cos8) 
photoelectrons ef 
Cutoff A, =he/d 3.2 | Bremsstrahlung Amin = Ace/K = he/eAV 3.5 
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wavelength Pair production hf =E,+E_= 3:5 
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Wien’s max! = 2.8978 x 10-3 m-K 3.3. Electron-positron (mc? + K,) + (n,c? + K_) 3.5 
displacement law annihilation =E,+E£, 

The diameter of an atomic nucleus is about 10 x 107!5m 8. In the photoelectric effect, how can a photon moving in one 


direction eject an electron moving in a different direction? 
What happens to conservation of momentum? 

In Figure 3.10, why does the photoelectric current rise 
slowly to its saturation value instead of rapidly, when the 
potential difference is greater than V, ? What does this figure 
indicate about the experimental difficulties that might arise 
from trying to determine V, in this way? 

Suppose that the frequency of a certain light source is just 
above the cutoff frequency of the emitter, so that the pho- 
toelectric effect occurs. To an observer in relative motion, 
the frequency might be Doppler shifted to a lower value 
that is below the cutoff frequency. Would this moving 
observer conclude that the photoelectric effect does not 
occur? Explain. 

Why do cavities that form in a wood fire seem to glow 
brighter than the burning wood itself? Is the temperature 
in such cavities hotter than the surface temperature of the 
exposed burning wood? 

What are the fields of classical physics on which the clas- 
sical theory of blackbody radiation is based? Why don’t 
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we believe that the “ultraviolet catastrophe” suggests that 
something is wrong with one of those classical theories? 

In what region of the electromagnetic spectrum do room- 
temperature objects radiate? What problems would we have 
if our eyes were sensitive in that region? 

How does the total intensity of thermal radiation vary when 
the temperature of an object is doubled? 
Compton-scattered photons of wavelength 4’ are observed 
at 90°. In terms of A’, what is the scattered wavelength 
observed at 180° ? 

The Compton-scattering formula suggests that objects 
viewed from different angles should show scattered light 
of different wavelengths. Why don’t we observe a change in 
color of objects as we change the viewing angle? 

You have a monoenergetic source of X rays of energy 84 
keV, but for an experiment you need 70 keV X rays. How 
would you convert the X-ray energy from 84 to 70 keV? 


3.1 Review of Electromagnetic Waves 


1 


A double-slit experiment is performed with sodium light 
(A = 589.0 nm). The slits are separated by 1.05 mm, and the 
screen is 2.357 m from the slits. Find the separation between 
adjacent maxima on the screen. 

In Example 3.1, what angle of incidence will produce the 
second-order Bragg peak? 

Monochromatic X rays are incident on a crystal in the geom- 
etry of Figure 3.5. The first-order Bragg peak is observed 
when the angle of incidence is 34.0°. The crystal spacing 
is known to be 0.347nm. (a) What is the wavelength of 
the X rays? (b) Now consider a set of crystal planes that 
makes an angle of 45° with the surface of the crystal (as 
in Figure 3.6). For X rays of the same wavelength, find the 
angle of incidence measured from the surface of the crystal 
that produces the first-order Bragg peak. At what angle from 
the surface does the emerging beam appear in this case? 

A certain device for analyzing electromagnetic radiation is 
based on the Bragg scattering of the radiation from a crystal. 
For radiation of wavelength 0.149 nm, the first-order Bragg 
peak appears centered at an angle of 15.15°. The aperture of 
the analyzer passes radiation in the angular range of 0.015°. 
What is the corresponding range of wavelengths passing 
through the analyzer? 


3.2 The Photoelectric Effect 


5. 


Find the momentum of (a) a 10.0-MeV gamma ray; (5) a 
25-keV X ray; (c) a 1.0-j2m infrared photon; (d) a 150-MHz 
radio-wave photon. Express the momentum in kg- m/s and 
eV/c. 

Radio waves have a frequency of the order of 1 to 100 MHz. 
What is the range of energies of these photons? Our bodies 
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19. 


20. 
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10. 


de 


12. 


TV sets with picture tubes can be significant emitters of 
X rays. What is the origin of these X rays? Estimate their 
wavelengths. 

The X-ray peaks of Figure 3.20 are not sharp but are spread 
over a range of wavelengths. What reasons might account 
for that spreading? 

A beam of photons passes through a block of matter. What 
are the three ways discussed in this chapter that the photons 
can lose energy in interacting with the material? 

Of the photon processes discussed in this chapter (pho- 
toelectric effect, thermal radiation, Compton scattering, 
bremsstrahlung, pair production, electron-positron annihila- 
tion), which conserve momentum? Energy? Mass? Number 
of photons? Number of electrons? Number of electrons 
minus number of positrons? 


are continuously bombarded by these photons. Why are they 
not dangerous to us? 

(a) What is the wavelength of an X-ray photon of energy 
10.0 keV? (6) What is the wavelength of a gamma-ray pho- 
ton of energy 1.00 MeV? (c) What is the range of energies 
of photons of visible light with wavelengths 350 to 700 nm? 
What is the cutoff wavelength for the photoelectric effect 
using an aluminum surface? 

A metal surface has a photoelectric cutoff wavelength 
of 325.6nm. It is illuminated with light of wavelength 
259.8 nm. What is the stopping potential? 

When light of wavelength 4 illuminates a copper surface, 
the stopping potential is V. In terms of V, what will be 
the stopping potential if the same wavelength is used to 
illuminate a sodium surface? 

The cutoff wavelength for the photoelectric effect in a cer- 
tain metal is 254nm. (a) What is the work function for 
that metal? (6) Will the photoelectric effect be observed for 
A >254nm or for X < 254 nm? 

A surface of zinc is illuminated and photoelectrons are 
observed. (a) What is the largest wavelength that will cause 
photoelectrons to be emitted? (b) What is the stopping 
potential when light of wavelength 220.0 nm is used? 


3.3 Blackbody Radiation 


13. 


14. 


(a) Show that in the classical result for the energy distribu- 
tion of the cavity wall oscillators (Eq. 3.32), the total number 
of oscillators at all energies is N. (b) Show that E,,, = kT 
for the classical oscillators. 

(a) Writing the discrete Maxwell-Boltzmann distribution 
for Planck’s cavity wall oscillators as N, = 4e~#n/ il 
(where A is a constant to be determined), show that the 
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16. 


17. 


18. 
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20. 


21. 


22. 


23. 


co 
condition )> N, = N gives A= N(1 — e/IT) as in Eq. 
n=0 


[o.e) 
3.38. [Hint: Use > e”* = (1 — e*)! J. (d) By taking the 
=0 
derivative with respect to x of the equation given in the 
[oe 
hint, show that )> ne”* = e*/(1 — e*)*. (c) Use this result 


to derive Eq. 3.40 from Eq. 3.39. (d) Show that E,,, = kT at 
large A and E,,, > 0 for small 2. 

By differentiating Eq. 3.41 show that /(A) has its maximum 
as expected according to Wien’s displacement law, Eq. 3.27. 
Integrate Eq. 3.41 to obtain Eq. 3.26. Use the definite inte- 
gral ie xdx/(e* — 1) = 24/15 to obtain Eq. 3.42 relating 
the Stefan-Boltzmann constant to Planck’s constant. 

Use the numerical value of the Stefan-Boltzmann constant to 
find the numerical value of Planck’s constant from Eq. 3.42. 
The surface of the Sun has a temperature of about 6000 K. At 
what wavelength does the Sun emit its peak intensity? How 
does this compare with the peak sensitivity of the human eye? 
The universe is filled with thermal radiation, which has a 
blackbody spectrum at an effective temperature of 2.7 K (see 
Chapter 15). What is the peak wavelength of this radiation? 
What is the energy (in eV) of quanta at the peak wavelength? 
In what region of the electromagnetic spectrum is this peak 
wavelength? 

(a) Assuming the human body (skin temperature 34°C) 
to behave like an ideal thermal radiator, find the wave- 
length where the intensity from the body is a maximum. 
In what region of the electromagnetic spectrum is radiation 
with this wavelength? (b) Making whatever (reasonable) 
assumptions you may need, estimate the power radiated by 
a typical person isolated from the surroundings. (c) Estimate 
the radiation power absorbed by a person in a room in which 
the temperature is 20°C. 

A cavity is maintained at a temperature of 1650 K. At 
what rate does energy escape from the interior of the cavity 
through a hole in its wall of diameter 1.00 mm? 

An analyzer for thermal radiation is set to accept wave- 
lengths in an interval of 1.55 nm. What is the intensity of the 
radiation in that interval at a wavelength of 875 nm emitted 
from a glowing object whose temperature is 1675 K? 

(a) Assuming the Sun to radiate like an ideal thermal source 
at a temperature of 6000 K, what is the intensity of the 
solar radiation emitted in the range 550.0 nm to 552.0 nm? 
(b) What fraction of the total solar radiation does this 
represent? 


3.4 The Compton Effect 


24. 
25. 


Show how Eq. 3.48 follows from Eq. 3.47. 

Incident photons of energy 10.39 keV are Compton scat- 
tered, and the scattered beam is observed at 45.00° relative 
to the incident beam. (a) What is the energy of the scattered 
photons at that angle? (b) What is the kinetic energy of the 
scattered electrons? 
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X-ray photons of wavelength 0.02480 nm are incident on a 
target and the Compton-scattered photons are observed at 
90.0°. (a) What is the wavelength of the scattered photons? 
(6) What is the momentum of the incident photons? Of the 
scattered photons? (c) What is the kinetic energy of the 
scattered electrons? (d) What is the momentum (magnitude 
and direction) of the scattered electrons? 

High-energy gamma rays can reach a radiation detector 
by Compton scattering from the surroundings, as shown 
in Figure 3.26. This effect is known as back-scattering. 
Show that, when E >> mc, the back-scattered photon has 
an energy of approximately 0.25 MeV, independent of the 
energy of the original photon, when the scattering angle is 
nearly 180°. 


|_| Detector 





FIGURE 3.26 Problem 27. 


Gamma rays of energy 0.662 MeV are Compton scattered. 
(a) What is the energy of the scattered photon observed at a 
scattering angle of 60.0°? (b) What is the kinetic energy of 
the scattered electrons? 


3.5 Other Photon Processes 


29. 


30. 


31. 


Suppose an atom of iron at rest emits an X-ray photon 
of energy 6.4 keV. Calculate the “recoil” momentum and 
kinetic energy of the atom. (Hint: Do you expect to need 
classical or relativistic kinetic energy for the atom? Is the 
kinetic energy likely to be much smaller than the atom’s rest 
energy?) 

What is the minimum X-ray wavelength produced in 
bremsstrahlung by electrons that have been accelerated 
through 2.50 x 104 V? 

An atom absorbs a photon of wavelength 375 nm and imme- 
diately emits another photon of wavelength 580 nm. What 
is the net energy absorbed by the atom in this process? 


General Problems 


32. 


33. 


A certain green light bulb emits at a single wavelength of 
550 nm. It consumes 55 W of electrical power and is 75% 
efficient in converting electrical energy into light. (a) How 
many photons does the bulb emit in one hour? (6) Assuming 
the emitted photons to be distributed uniformly in space, 
how many photons per second strike a 10 cm by 10 cm paper 
held facing the bulb at a distance of 1.0 m? 

When sodium metal is illuminated with light of wavelength 
4.20 x 10° nm, the stopping potential is found to be 0.65 V; 
when the wavelength is changed to 3.10 x 107 nm, the 
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34. 


333 
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39. 


stopping potential is 1.69 V. Using only these data and the 
values of the speed of light and the electronic charge, find the 
work function of sodium and a value of Planck’s constant. 
A photon of wavelength 192 nm strikes an aluminum sur- 
face along a line perpendicular to the surface and releases 
a photoelectron traveling in the opposite direction. Assume 
the recoil momentum is taken up by a single aluminum 
atom on the surface. Calculate the recoil kinetic energy of 
the atom. Would this recoil energy significantly affect the 
kinetic energy of the photoelectron? 

A certain cavity has a temperature of 1150 K. (a) At what 
wavelength will the intensity of the radiation inside the cavity 
have its maximum value? (b) As a fraction of the maximum 
intensity, what is the intensity at twice the wavelength found 
in part (a)? 

In Compton scattering, calculate the maximum kinetic 
energy given to the scattered electron for a given photon 
energy. 

The COBE satellite was launched in 1989 to study the 
cosmic background radiation and measure its temperature. 
By measuring at many different wavelengths, researchers 
were able to show that the background radiation exactly 
followed the spectral distribution expected for a black- 
body. At a wavelength of 0.133 cm, the radiant intensity is 
1.440 x 10-7 W/m? ina wavelength interval of 0.00833 cm. 
What is the temperature of the radiation that would be 
deduced from these data? 

The WMAP satellite launched in 2001 studied the cosmic 
microwave background radiation and was able to chart small 
fluctuations in the temperature of different regions of the 
background radiation. These fluctuations in temperature cor- 
respond to regions of large and small density in the early 
universe. The satellite was able to measure differences in 
temperature of 2 x 107> K at a temperature of 2.7250 K. At 
the peak wavelength, what is the difference in the radiation 
intensity per unit wavelength interval between the “hot” and 
“cold” regions of the background radiation? 

You have been hired as an engineer on a NASA project 
to design a microwave spectrometer for an orbital mis- 
sion to measure the cosmic background radiation, which 
has a blackbody spectrum with an effective temperature 
of 2.725 K. (a) The spectrometer is to scan the sky 
between wavelengths of 0.50mm and 5.0mm, and at each 
wavelength it accepts radiation in a wavelength range of 
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42. 


43. 


44. 


3.0 x 10-4 mm. What maximum and minimum radiation 
intensity do you expect to find in this region? (b) The 
photon detector in the spectrometer is in the form of a 
disk of diameter 0.86 cm. How many photons per second 
will the spectrometer record at its maximum and minimum 
intensities? 

A photon of wavelength 7.52 pm scatters from a free elec- 
tron at rest. After the interaction, the electron is observed to 
be moving in the direction of the original photon. Find the 
momentum of the electron. 

A hydrogen atom is moving at a speed of 125.0 m/s. It 
absorbs a photon of wavelength 97 nm that is moving in the 
opposite direction. By how much does the speed of the atom 
change as a result of absorbing the photon? 

Before a positron and an electron annihilate, they form a sort 
of “atom” in which each orbits about their common center 
of mass with identical speeds. As a result of this motion, the 
photons emitted in the annihilation show a small Doppler 
shift. In one experiment, the Doppler shift in energy of the 
photons was observed to be 2.41 keV. (a) What would be 
the speed of the electron or positron before the annihilation 
to produce this Doppler shift? (6) The positrons form these 
atom-like structures with the nearly “free” electrons in a 
solid. Assuming the positron and electron must have about 
the same speed to form this structure, find the kinetic energy 
of the electron. This technique, called “Doppler broaden- 
ing,” is an important method for learning about the energies 
of electrons in materials. 

Prove that it is not possible to conserve both momentum 
and total relativistic energy in the following situation: A 
free electron moving at velocity V emits a photon and then 
moves at a slower velocity V’. 

A photon of energy £ interacts with an electron at rest 
and undergoes pair production, producing a positive elec- 
tron (positron) and an electron (in addition to the original 
electron): 


photon + e~ > ef +e7 +e7 


The two electrons and the positron move off with iden- 
tical momenta in the direction of the initial photon. Find 
the kinetic energy of the three final particles and find the 
energy E of the photon. (Hint: Conserve momentum and 
total relativistic energy.) 





THE WAVELIKE PROPERTIES OF 
PARTICLES 
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In classical physics, the laws describing the behavior of waves and _ parti- 
cles are fundamentally different. Projectiles obey particle-type laws, such as 
Newtonian mechanics. Waves undergo interference and diffraction, which cannot 
be explained by the Newtonian mechanics associated with particles. The energy 
carried by a particle is confined to a small region of space; a wave, on the other 
hand, distributes its energy throughout space in its wavefronts. In describing the 
behavior of a particle we often want to specify its location, but this is not so easy 
to do for a wave. How would you describe the exact location of a sound wave or 
a water wave? 

In contrast to this clear distinction found in classical physics, quantum physics 
requires that particles sometimes obey the rules that we have previously established 
for waves, and we shall use some of the language associated with waves to describe 
particles. The system of mechanics associated with quantum systems is sometimes 
called “wave mechanics” because it deals with the wavelike behavior of particles. 
In this chapter we discuss the experimental evidence in support of this wavelike 
behavior for particles such as electrons. 

As you study this chapter, notice the frequent references to such terms as the 
probability of the outcome of a measurement, the average of many repetitions 
of a measurement, and the statistical behavior of a system. These terms are 
fundamental to quantum mechanics, and you cannot begin to understand quantum 
behavior until you feel comfortable with discarding such classical notions as fixed 
trajectories and certainty of outcome, while substituting the quantum mechanical 
notions of probability and statistically distributed outcomes. 


4.1 DE BROGLIE’S HYPOTHESIS 


Progress in physics often can be characterized by long periods of experimental 
and theoretical drudgery punctuated occasionally by flashes of insight that cause 
profound changes in the way we view the universe. Frequently the more profound 
the insight and the bolder the initial step, the simpler it seems in historical 
perspective, and the more likely we are to sit back and wonder, “Why didn’t 
I think of that?” Einstein’s special theory of relativity is one example of such 
insight; the hypothesis of the Frenchman Louis de Broglie is another.* 

In the previous chapter we discussed the double-slit experiment (which can be 
understood only if light behaves as a wave) and the photoelectric and Compton 
effects (which can be understood only if light behaves as a particle). Is this dual 
particle-wave nature a property only of light or of material objects as well? In 
a bold and daring hypothesis in his 1924 doctoral dissertation, de Broglie chose 
the latter alternative. Examining Eq. 3.20, E = Af, and Eq. 3.22, p = h/A, we find 
some difficulty in applying the first equation in the case of particles, for we cannot 
be sure whether E should be the kinetic energy, total energy, or total relativistic 
energy (all, of course, are identical for light). No such difficulties arise from the 
Louis de Broglie (1892-1987, second relationship. De Broglie suggested, lacking any experimental evidence in 





France). A member of an aristocratic 
family, his work contributed substan- 


tially to the early development of the *De Broglie’s name should be pronounced “deh-BROY” or “deh-BROY-eh,” but it is often said as 
quantum theory. “deh-BROH-lee.” 
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support of his hypothesis, that associated with any material particle moving with 
momentum p there is a wave of wavelength i, related to p according to 


A= 
P 


(4.1) 


where / is Planck’s constant. The wavelength A of a particle computed according 


to Eq. 4.1 is called its de Broglie wavelength. 


J Example 4.1 





Compute the de Broglie wavelength of the following: (a) A 
1000-kg automobile traveling at 100 m/s (about 200 mi/h). 
(b) A 10-g bullet traveling at 500 m/s. (c) A smoke particle 
of mass 107° g moving at 1 cm/s. (d) An electron with 
a kinetic energy of 1 eV. (e) An electron with a kinetic 
energy of 100 MeV. 


Solution 
(a) Using the classical relation between velocity and 
momentum, 











i: & 6x 10-343. 
(pe OO Khe On 
Pp mv _— (10°kg)(100 m/s) 
(b) As in part (a), 
h 6.6 x 10-34J. 
oe AO Se on 
mv (10-2kg)(500 m/s) 
(c) 
h 6x 10-34]. 
ee OTS = e665 


mv (10-!2kg)(10~2 m/s) 


(d) The rest energy (mc’) of an electron is 5.1 x 10° eV. 
Because the kinetic energy (1 eV) is much less than the rest 
energy, we can use nonrelativistic kinematics. 


p=wv2mK 


= [209.1 x 10-3! kg)(1 eV)(1.6 x 10-1? SeV) 
=5.4x 10° kg-m/s 





Then, 


h 6.6 x 10-44J-s 


A= = 
p 54x 10-*%kg-m/s 





=12x10-?m=1.2nm 


We can also find this solution in the following way, using 
p= V2mK and he = 1240eV-nm. 





cp = cV2mK = J2(mce2)K 





= /2(5.1 x 105eV)(1 eV) = 1.0 x 102 eV 


h he = 1240eV -nm 


Pp pe 10x 103eV as 





This method may seem artificial at first, but with prac- 
tice it becomes quite useful, especially because energies 
are usually given in electron-volts in atomic and nuclear 
physics. 

(e) In this case, the kinetic energy is much greater than the 
rest energy, and so we are in the extreme relativistic realm, 
where K = E = pc, as in Eq. 2.40. The wavelength is 


hc _ 1240MeV- fm 


A= = 
pe 100 MeV 


= 12fm 








Note that the wavelengths computed in parts (a), (b), and (c) are far too small to be 
observed in the laboratory. Only in the last two cases, in which the wavelength is 
of the same order as atomic or nuclear sizes, do we have any chance of observing 
the wavelength. Because of the smallness of h, only for particles of atomic or 
nuclear size will the wave behavior be observable. 

Two questions immediately follow. First, just what sort of wave is it that has 
this de Broglie wavelength? That is, what does the amplitude of the de Broglie 
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Light waves 
(Plane wave fronts) 




















Screen 


FIGURE 4.1 Light waves (repre- 
sented as plane wave fronts) are inci- 
dent on a narrow slit of width a. 
Diffraction causes the waves to spread 
after passing through the slit, and the 
intensity varies along the screen. The 
photograph shows the resulting inten- 
sity pattern. 


wave measure? We’ll discuss the answer to this question later in this chapter. 
For now, we assume that, associated with the particle as it moves, there is a de 
Broglie wave of wavelength 4, which shows itself when a wave-type experiment 
(such as diffraction) is performed on it. The outcome of the wave-type experiment 
depends on this wavelength. The de Broglie wavelength, which characterizes the 
wave-type behavior of particles, is central to the quantum theory. 

The second question then occurs: Why was this wavelength not directly 
observed before de Broglie’s time? As parts (a), (6), and (c) of Example 4.1 
showed, for ordinary objects the de Broglie wavelength is very small. Suppose we 
tried to demonstrate the wave nature of these objects through a double-slit type 
of experiment. Recall from Eq. 3.16 that the spacing between adjacent fringes 
in a double-slit experiment is Ay = AD/d. Putting in reasonable values for the 
slit separation d and slit-to-screen distance D, you will find that there is no 
achievable experimental configuration that can produce an observable separation 
of the fringes (see Problem 9). There is no experiment that can be done to 
reveal the wave nature of macroscopic (laboratory-sized) objects. Experimental 
verification of de Broglie’s hypothesis comes only from experiments with objects 
on the atomic scale, which are discussed in the next section. 


4.2 EXPERIMENTAL EVIDENCE FOR 


DE BROGLIE WAVES 





The indications of wave behavior come mostly from interference and diffraction 
experiments. Double-slit interference, which was reviewed in Section 3.1, is 
perhaps the most familiar type of interference experiment, but the experimental 
difficulties of constructing double slits to do interference experiments with beams 
of atomic or subatomic particles were not solved until long after the time of de 
Broglie’s hypothesis. We discuss these experiments later in this section. First 
we'll discuss diffraction experiments with electrons. 


Particle Diffraction Experiments 


Diffraction of light waves is discussed in most introductory physics texts and is 
illustrated in Figure 4.1 for light diffracted by a single slit. For light of wavelength 
A incident ona slit of width a, the diffraction minima are located at angles given by 


asinO=ni n=1,2,3,... (4.2) 


on either side of the central maximum. Note that most of the light intensity falls 
in the central maximum. 

The experiments that first verified de Broglie’s hypothesis involve electron 
diffraction, not through an artificially constructed single slit (as for the diffraction 
pattern in Figure 4.1) but instead through the atoms of a crystal. The outcomes 
of these experiments resemble those of the similar X-ray diffraction experiments 
illustrated in Section 3.1. 

In an electron diffraction experiment, a beam of electrons is accelerated from 
rest through a potential difference AV, acquiring a nonrelativistic kinetic energy 
K=eAV and a momentum p= /2mK. Wave mechanics would describe 
the beam of electrons as a wave of wavelength 4 = h/p. The beam strikes a 
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crystal, and the scattered beam is photographed (Figure 4.2). The similarity 
between electron diffraction patterns (Figure 4.2) and X-ray diffraction patterns 
(Figure 3.7) strongly suggests that the electrons are behaving as waves. 

The “rings” produced in X-ray diffraction of polycrystalline materials 
(Figure 3.85) are also produced in electron diffraction, as shown in Figure 4.3, 
again providing strong evidence for the similarity in the wave behavior of 
electrons and X rays. Experiments of the type illustrated in Figure 4.3 were 
first done in 1927 by G. P. Thomson, who shared the 1937 Nobel Prize for this 
work. (Thomson’s father, J. J. Thomson, received the 1906 Nobel Prize for his 
discovery of the electron and measurement of its charge-to-mass ratio. Thus 
it can be said that Thomson, the father, discovered the particle nature of the 
electron, while Thomson, the son, discovered its wave nature.) 

An electron diffraction experiment gave the first experimental confirmation 
of the wave nature of electrons (and the quantitative confirmation of the de 
Broglie relationship 4 = h/p) soon after de Broglie’s original hypothesis. In 
1926, at the Bell Telephone Laboratories, Clinton Davisson and Lester Germer 
were investigating the reflection of electron beams from the surface of nickel 
crystals. A schematic view of their apparatus is shown in Figure 4.4. A beam of 
electrons from a heated filament is accelerated through a potential difference AV. 
After passing through a small aperture, the beam strikes a single crystal of nickel. 
Electrons are scattered in all directions by the atoms of the crystal, some of them 
striking a detector, which can be moved to any angle ¢ relative to the incident 
beam and which measures the intensity of the electron beam scattered at that angle. 

Figure 4.5 shows the results of one of the experiments of Davisson and Germer. 
When the accelerating voltage is set at 54 V, there is an intense reflection of the 
beam at the angle ¢ = 50°. Let’s see how these results give confirmation of the 
de Broglie wavelength. 


+V 


Electron [~® Detector 


beam 





Crystal 





FIGURE 4.4 Apparatus used by 
Davisson and Germer to study 
electron diffraction. Electrons 
leave the filament F and are accel- 
erated by the voltage V. The beam 
strikes a crystal and the scattered 
beam is detected at an angle @ 
relative to the incident beam. The 
detector can be moved in the range 
0 to 90°. 


FIGURE 4.3 Electron diffraction of 
polycrystalline beryllium. Note the sim- 
ilarity between this pattern and the 
pattern for X-ray diffraction of a poly- 
crystalline material (Figure 3.85). 
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Electron beam 





FIGURE 4.2 (Top) Electron diffrac- 
tion apparatus. (Bottom) Electron 
diffraction pattern. Each bright dot is 
a region of constructive interference, 
as in the X-ray diffraction patterns of 
Figure 3.7. The target is a crystal of 
Ti, Nbj 9049. 

















FIGURE 4.5 Results of Davisson 
and Germer. Each point on the plot 
represents the relative intensity 
when the detector in Figure 4.4 is 
located at the corresponding angle 
¢ measured from the vertical axis. 
Constructive interference causes 
the intensity of the reflected beam 
to reach amaximum at @ = 50° for 
V=S4V. 
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FIGURE 4.6 The crystal surface acts 
like a diffraction grating with spac- 
ing d. 





FIGURE 4.7 Diffraction of neutrons 
by a sodium chloride crystal. 
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FIGURE 4.8 Diffraction of 1-GeV 
protons by oxygen nuclei. The pat- 
tern of maxima and minima is similar 
to that of single-slit diffraction of 
light waves. [Source: H. Palevsky et 
al., Physical Review Letters 18, 1200 
(1967).] 
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Each of the atoms of the crystal can act as a scatterer, so the scattered electron 
waves can interfere, and we have a crystal diffraction grating for the electrons. 
Figure 4.6 shows a simplified representation of the nickel crystal used in the 
Davisson-Germer experiment. Because the electrons were of low energy, they did 
not penetrate very far into the crystal, and it is sufficient to consider the diffraction 
to take place in the plane of atoms on the surface. The situation is entirely similar 
to using a reflection-type diffraction grating for light; the spacing d between the 
rows of atoms on the crystal is analogous to the spacing between the slits in the 
optical grating. The maxima for a diffraction grating occur at angles @ such that 
the path difference between adjacent rays d sin@ is equal to a whole number of 
wavelengths: 

dsing = nk N= 1,253.04. (4.3) 
where n is the order number of the maximum. 

From independent data, it is known that the spacing between the rows of atoms 
in a nickel crystal is d = 0.215nm. The peak at ¢ = 50° must be a first-order 
peak (n = 1), because no peaks were observed at smaller angles. If this is indeed 
an interference maximum, the corresponding wavelength is, from Eq. 4.3, 


A = dsing = (0.215 nm)(sin 50°) = 0.165 nm 
We can compare this value with that expected on the basis of the de Broglie 


theory. An electron accelerated through a potential difference of 54 V has a kinetic 
energy of 54eV and therefore a momentum of 








i 1 1 
p=V2mK = —V2me2K = —/2(511, 000 eV) (54 eV) = —(7430 eV) 
C (63 Cc 


The de Broglie wavelength is } = h/p = hc/pc. Using hc = 1240 eV- nm, 


he _ 1240eV-nm 


A= — 
pe 7430 eV 


= 0.167nm 





This is in excellent agreement with the value found from the diffraction maxi- 
mum, and provides strong evidence in favor of the de Broglie theory. For this 
experimental work, Davisson shared the 1937 Nobel Prize with G. P. Thomson. 

The wave nature of particles is not exclusive to electrons; any particle with 
momentum p has de Broglie wavelength h/p. Neutrons are produced in nuclear 
reactors with kinetic energies corresponding to wavelengths of roughly 0.1 nm; 
these also should be suitable for diffraction by crystals. Figure 4.7 shows that 
diffraction of neutrons by a salt crystal produces the same characteristic patterns 
as the diffraction of electrons or X rays. Clifford Shull shared the 1994 Nobel 
Prize for the development of the neutron diffraction technique. 

To study the nuclei of atoms, much smaller wavelengths are needed, of the order 
of 10~!> m. Figure 4.8 shows the diffraction pattern produced by the scattering 
of 1-GeV kinetic energy protons by oxygen nuclei. Maxima and minima of the 
diffracted intensity appear in a pattern similar to the single-slit diffraction shown 
in Figure 4.1. (The intensity at the minima does not fall to zero because nuclei 
do not have a sharp boundary. The determination of nuclear sizes from such 
diffraction patterns is discussed in Chapter 12.) 


J Example 4.2 
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Protons of kinetic energy 1.00 GeV were diffracted by 
oxygen nuclei, which have a radius of 3.0 fm, to produce 
the data shown in Figure 4.8. Calculate the expected angles 
where the first three diffraction minima should appear. 


Solution 

The total relativistic energy of the protons is E = 
K + mc* = 1.00 GeV + 0.94 GeV = 1.94GeV is, so their 
momentum is 


1 
p= —-VE* — (me?)? 
c 


1 
= —/(1.94 GeV)? — (0.94 GeV)? = 1.70 GeV/c 


Cc 





The corresponding de Broglie wavelength is 


hhc _ 1240MeV- fim 


A= = = 
Pp pe 1700 MeV 


= 0.73 fm 





We can represent the oxygen nuclei as circular disks, for 
which the diffraction formula is a bit different from Eq. 4.2: 


asin @ = 1.22nd, where a is the diameter of the diffracting 
object. Based on this formula, the first diffraction minimum 
(n = 1) should appear at the angle 


1.22nd _ (1.22)(1)(0.73 fm) 





sin@ = = 0.148 
6.0 fm 
or @=8.5°. Because the sine of the diffraction 
angle is proportional to the index n, the n=2 
minimum should appear at the angle where sin 


§=2 x 0.148 = 0.296 (6 = 17.2°), and the n = 3 minimum 
where sin 6 =3 x 0.148 = 0.444 (6 = 26.4”). 

From the data in Figure 4.8, we see the first diffraction 
minimum at an angle of about 10°, the second at about 
18°, and the third at about 27°, all in very good agreement 
with the expected values. The data don’t exactly follow 
the formula for diffraction by a disk, because nuclei don’t 
behave quite like disks. In particular, they have diffuse 
rather than sharp edges, which prevents the intensity at 
the diffraction minima from falling to zero and also alters 
slightly the locations of the minima. 





Double-Slit Experiments with Particles 


The definitive evidence for the wave nature of light was deduced from the 
double-slit experiment performed by Thomas Young in 1801 (discussed in 
Section 3.1). In principle, it should be possible to do double-slit experiments 
with particles and thereby directly observe their wavelike behavior. However, the 
technological difficulties of producing double slits for particles are formidable, 
and such experiments did not become possible until long after the time of de 
Broglie. The first double-slit experiment with electrons was done in 1961. A 
diagram of the apparatus is shown in Figure 4.9. The electrons from a hot filament 
were accelerated through 50 kV (corresponding to A = 5.4pm) and then passed 
through a double slit of separation 2.0 wm and width 0.5 wm. A photograph of 
the resulting intensity pattern is shown in Figure 4.10. The similarity with the 
double-slit pattern for light (Figure 3.2) is striking. 

A similar experiment can be done for neutrons. A beam of neutrons from 
a nuclear reactor can be slowed to a room-temperature “thermal” energy 
distribution (average K + kT ~ 0.025eV), and a specific wavelength can be 
selected by a scattering process similar to Bragg diffraction (see Eq. 3.18 
and Problem 32 at the end of the present chapter). In one experiment, neu- 
trons of kinetic energy 0.00024 eV and de Broglie wavelength 1.85 nm passed 
through a gap of diameter 148m in a material that absorbs virtually all 
of the neutrons incident on it (Figure 4.11). In the center of the gap was a 
boron wire (also highly absorptive for neutrons) of diameter 104 wm. The neu- 
trons could pass on either side of the wire through slits of width 22 um. The 
intensity of neutrons that pass through this double slit was observed by sliding 


Fluorescent 
screen 


| erectrons 
FY) _ SS 


Photographic 
film 


FIGURE 4.9 Double-slit apparatus 
for electrons. Electrons from the fila- 
ment F are accelerated through 50 kV 
and pass through the double slit. They 
produce a visible pattern when they 
strike a fluorescent screen (like a TV 
screen), and the resulting pattern is 
photographed. A photograph is shown 
in Figure 4.10. [See C. Jonsson, Amer- 
ican Journal of Physics 42, 4 (1974).] 
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FIGURE 4.10  Double-slit interfer- 
ence pattern for electrons. 
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FIGURE 4.12 Intensity pattern ob- 


served for double-slit interference 
with neutrons. The spacing between 
the maxima is about 75 wm. [Source: 
R. Gahler and A. Zeilinger, American 
Journal of Physics 59, 316 (1991).] 
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FIGURE 4.13 Intensity pattern ob- 


served for double-slit interference 
with helium atoms. [Source: O. Car- 
nal and J. Mlynek, Physical Review 
Letters 66, 2689 (1991).] 
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Neutron beam slit 
FIGURE 4.11 Double-slit apparatus for neutrons. Thermal neutrons from a reactor 
are incident on a crystal; scattering through a particular angle selects the energy of 
the neutrons. After passing through the double slit, the neutrons are counted by the 
scanning slit assembly, which moves laterally. 


another slit across the beam and measuring the intensity of neutrons passing 
through this “scanning slit.” Figure 4.12 shows the resulting pattern of intensity 
maxima and minima, which leaves no doubt that interference is occurring and that 
the neutrons have a corresponding wave nature. The wavelength can be deduced 
from the slit separation using Eq. 3.16 to obtain the spacing between adjacent 
maxima, Ay = y,,, — y,,. Estimating the spacing Ay from Figure 4.12 to be about 
75 um, we obtain 


dAy _ (126 wm)(75 wm) 
i 5m 





A= = 1.89nm 
This result agrees very well with the de Broglie wavelength of 1.85 nm selected 
for the neutron beam. 

It is also possible to do a similar experiment with atoms. In this case, a 
source of helium atoms formed a beam (of velocity corresponding to a kinetic 
energy of 0.020 eV) that passed through a double slit of separation 8 4m and 
width | zm. Again a scanning slit was used to measure the intensity of the beam 
passing through the double slit. Figure 4.13 shows the resulting intensity pattern. 
Although the results are not as dramatic as those for electrons and neutrons, there 
is clear evidence of interference maxima and minima, and the separation of the 
maxima gives a wavelength that is consistent with the de Broglie wavelength (see 
Problem 8). 

Diffraction can be observed with even larger objects. Figure 4.14 shows the 
pattern produced by fullerene molecules (C¢9) in passing through a diffraction 
grating with a spacing of d = 100nm. The diffraction pattern was observed at 
a distance of 1.2 m from the grating. Estimating the separation of the maxima 
in Figure 4.14 as 50 um, we get the angular separation of the maxima to be 
6 ~ tan@ = (50 um)/(1.2m) = 4.2 x 107> rad, and thus A = dsin@ = 4.2 pm. 
For Cg molecules with a speed of 117 m/s used in this experiment, the expected 
de Broglie wavelength is 4.7 pm, in good agreement with our estimate from the 
diffraction pattern. 

In this chapter we have discussed several interference and diffraction 
experiments using different particles—electrons, protons, neutrons, atoms, 
and molecules. These experiments are not restricted to any particular type of 
particle or to any particular type of observation. They are examples of a general 
phenomenon, the wave nature of particles, that was unobserved before 1920 
because the necessary experiments had not yet been done. Today this wave 
nature is used as a basic tool by scientists. For example, neutron diffraction 
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FIGURE 4.14 Diffraction grating pat- 
tern produced by Cy molecules. 
[Source: O. Nairz, M. Arndt, and 
A. Zeilinger, American Journal of 
Physics 71, 319 (2003).] 
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FIGURE 4.15 The atomic structure 
of solid benzene as deduced from 
neutron diffraction. The circles indi- 
cate contours of constant density. The 
black circles show the locations of the 
s1x carbon atoms that form the familiar 
benzene ring. The blue circles show 


the locations of the hydrogen atoms. 


gives detailed information on the structure of solid crystals and of complex 
molecules (Figure 4.15). The electron microscope uses electron waves to illumi- 
nate and form an image of objects; because the wavelength can be made thousands 
of times smaller than that of visible light, it is possible to resolve and observe 
small details that are not observable with visible light (Figure 4.16). 


Through Which Slit Does the Particle Pass? 


When we do a double-slit experiment with particles such as electrons, it is 
tempting to try to determine through which slit the particle passes. For example, 
we could surround each slit with an electromagnetic loop that causes a meter to 
deflect whenever a charged particle or perhaps a particle with a magnetic moment 
passes through the loop (Figure 4.17). If we fired the particles through the slits at 
a slow enough rate, we could track each particle as it passed through one slit or 
the other and then appeared on the screen. 

If we performed this imaginary experiment, the result would no longer be an 
interference pattern on the screen. Instead, we would observe a pattern similar to 
that shown in Figure 4.17, with “hits” in front of each slit, but no interference 
fringes. No matter what sort of device we use to determine through which slit the 
particle passes, the interference pattern will be destroyed. The classical particle 
must pass through one slit or the other; only a wave can reveal interference, 
which depends on parts of the wavefront passing through both slits and then 
recombining. 

When we ask through which slit the particle passed, we are investigating only 
the particle aspects of its behavior, and we cannot observe its wave nature (the 
interference pattern). Conversely, when we study the wave nature, we cannot 
simultaneously observe the particle nature. The electron will behave as a particle 
or a wave, but we cannot observe both aspects of its behavior simultaneously. 
This curious aspect of quantum mechanics was also discussed for photons in 
Section 3.6, where we discovered that experiments can reveal either the particle 
nature of the photon or its wave nature, but not both aspects simultaneously. 





FIGURE 4.16 Electron microscope 
image of bacteria on the surface of 
a human tongue. The magnification 
here is about a factor of 5000. 
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FIGURE 4.18 (a) A pure sine wave, 
which extends from —o to +0, 
(b) A narrow wave pulse. 
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FIGURE 4.17 Apparatus to record passage of electrons through slits. 
Each slit is surrounded by a loop with a meter that signals the passage 
of an electron through the slit. No interference fringes are seen on the 
screen. 


This is the basis for the principle of complementarity, which asserts that the 
complete description of a photon or a particle such as an electron cannot be made 
in terms of only particle properties or only wave properties, but that both aspects of 
its behavior must be considered. Moreover, the particle and wave natures cannot 
be observed simultaneously, and the type of behavior that we observe depends 
on the kind of experiment we are doing: a particle-type experiment shows only 
particle like behavior, and a wave-type experiment shows only wavelike behavior. 


4.3 UNCERTAINTY RELATIONSHIPS FOR CLASSICAL 


WAVES 





In quantum mechanics, we want to use de Broglie waves to describe particles. In 
particular, the amplitude of the wave will tell us something about the location of 
the particle. Clearly a pure sinusoidal wave, as in Figure 4.18a, is not much use 
in locating a particle—the wave extends from —oco to +00, so the particle might 
be found anywhere in that region. On the other hand, a narrow wave pulse like 
Figure 4.185 does a pretty good job of locating the particle in a small region of 
space, but this wave does not have an easily identifiable wavelength. In the first 
case, we know the wavelength exactly but have no knowledge of the location of 
the particle, while in the second case we have a good idea of the location of the 
particle but a poor knowledge of its wavelength. Because wavelength is associated 
with momentum by the de Broglie relationship (Eq. 4.1), a poor knowledge of the 
wavelength is associated with a poor knowledge of the particle’s momentum. For 
a classical particle, we would like to know both its location and its momentum as 
precisely as possible. For a quantum particle, we are going to have to make some 
compromises—the better we know its momentum (or wavelength), the less we 
know about its location. We can improve our knowledge of its location only at 
the expense of our knowledge of its momentum. 
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This competition between knowledge of location and knowledge of wavelength 
is not restricted to de Broglie waves—classical waves show the same effect. All 
real waves can be represented as wave packets—disturbances that are localized to 
a finite region of space. We will discuss more about constructing wave packets in 
Section 4.5. In this section we will examine this competition between specifying 
the location and the wavelength of classical waves more closely. 

Figure 4.19a shows a very small wave packet. The disturbance is well localized 
to a small region of space of length Ax. (Imagine listening to a very short burst 
of sound, of such brief duration that it is hard for you to recognize the pitch or 
frequency of the wave.) Let’s try to measure the wavelength of this wave packet. 
Placing a measuring stick along the wave, we have some difficulty defining exactly 
where the wave starts and where it ends. Our measurement of the wavelength is 
therefore subject to a small uncertainty AX. Let’s represent this uncertainty as a 
fraction ¢ of the wavelength 4, so that AA ~ €A. The fraction « is certainly less 
than 1, but it is probably greater than 0.01, so we estimate that e ~ 0.1 to within 
an order of magnitude. (In our discussion of uncertainty, we use the ~ symbol 
to indicate a rough order-of-magnitude estimate.) That is, the uncertainty in our 
measurement of the wavelength might be roughly 10% of the wavelength. 

The size of this wave disturbance is roughly one wavelength, so Ax ¥ i. For 
this discussion we want to examine the product of the size of the wave packet and 
the uncertainty in the wavelength, Ax times AA with Ax © A and AA ~ ea: 


AxAA ~ €)? (4.4) 


This expression shows the inverse relationship between the size of the wave 
packet and the uncertainty in the wavelength: for a given wavelength, the smaller 
the size of the wave packet, the greater the uncertainty in our knowledge of the 
wavelength. That is, as Ax gets smaller, AA must become larger. 

Making a larger wave packet doesn’t help us at all. Figure 4.195 shows a larger 
wave packet with the same wavelength. Suppose this larger wave packet contains 



































FIGURE 4.19 (a) Measuring the wavelength of a wave represented by a 
small wave packet of length roughly one wavelength. (b) Measuring the 
wavelength of a wave represented by a large wave packet consisting of N 
waves. 
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| Example 4.3 


N cycles of the wave, so that Ax * NA. Again using our measuring stick, we try 
to measure the size of N wavelengths, and dividing this distance by N we can then 
determine the wavelength. We still have the same uncertainty of ¢ in locating the 
start and end of this wave packet, but when we divide by N to find the wavelength, 
the uncertainty in one wavelength becomes Ad ~ ¢€A/N. For this larger wave 
packet, the product of Ax and AA is AxAA ~ (NA)(eA/N) = £27, exactly the 
same as in the case of the smaller wave packet. Equation 4.4 is a fundamental 
property of classical waves, independent of the type of wave or the method used 
to measure its wavelength. This is the first of the uncertainty relationships for 
classical waves. 








In a measurement of the wavelength of water waves, 10 
wave cycles are counted in a distance of 196 cm. Estimate 
the minimum uncertainty in the wavelength that might be 
obtained from this experiment. 


Solution 

With 10 wave crests ina distance of 196 cm, the wavelength 
is about (196 cm)/10 = 19.6cm. We can take e ~ 0.1 asa 
good order-of-magnitude estimate of the typical precision 


that might be obtained. From Eq. 4.4, we can find the 
uncertainty in wavelength: 


eh? _ (0.1)(19.6 em)? 
Ax 196cm 


Ar ~ =0.2cm 





With an uncertainty of 0.2 cm, the “true” wavelength might 
range from 19.5cm to 19.7cm, so we might express this 
result as 19.6 + 0.1 cm. 








Oo © 
Q @ 
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FIGURE 4.20 Measuring the period 
of a wave represented by a small wave 
packet of duration roughly one period. 


The Frequency-Time Uncertainty Relationship 


We can take a different approach to uncertainty for classical waves by imagining a 
measurement of the period rather than the wavelength of the wave that comprises 
our wave packet. Suppose we have a timing device that we use to measure 
the duration of the wave packet, as in Figure 4.20. Here we are plotting the 
wave disturbance as a function of time rather than location. The “size” of the 
wave packet is now its duration in time, which is roughly one period 7 for this 
wave packet, so that At ~ T. Whatever measuring device we use, we have some 
difficulty locating exactly the start and end of one cycle, so we have an uncertainty 
AT in measuring the period. As before, we'll assume this uncertainty is some 
small fraction of the period: AT ~ eT. To examine the competition between the 
duration of the wave packet and our ability to measure its period, we calculate the 
product of At and AT: 


AtAT ~ eT? (4.5) 


This is the second of our uncertainty relationships for classical waves. It shows that 
for a wave of a given period, the smaller the duration of the wave packet, the larger 
is the uncertainty in our measurement of the period. Note the similarity between 
Eqs. 4.4 and 4.5, one representing relationships in space and the other in time. 

It will turn out to be more useful if we write Eq. 4.5 in terms of frequency 
instead of period. Given that period 7 and frequency f are related by f = 1/T, how 
is Af related to AT? The correct relationship is certainly not Af = 1/AT, which 
would imply that a very small uncertainty in the period would lead to a very large 
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uncertainty in the frequency. Instead, they should be directly related—the better 
we know the period, the better we know the frequency. Here is how we obtain the 
relationship: Beginning with f = 1/T, we take differentials on both sides: 


1 
df =— 7 dT 
Next we convert the infinitesimal differentials to finite intervals, and because we 


are interested only in the magnitude of the uncertainties we can ignore the minus 
sign: 


1 
Af = pt (4.6) 
Combining Eqs. 4.5 and 4.6, we obtain 
AfAt~eé (4.7) 


Equation 4.7 shows that the longer the duration of the wave packet, the more 
precisely we can measure its frequency. 


| Example 4.4 





An electronics salesman offers to sell you a frequency- must have an associated uncertainty of about 
measuring device. When hooked up to a sinusoidal signal, 


it automatically displays the frequency of the signal, and to Apr € _ 01 
account for frequency variations, the frequency is remea- At Is 
sured once each second and the display is updated. The = 61 he 


salesman claims the device to be accurate to 0.01 Hz. Is 


: ; a5 
Heise Tammy alae It appears that the salesman may be exaggerating the 


Solution precision of this device. 
Based on Eq. 4.7, and again estimating ¢ to be about 0.1, we 


know that a measurement of frequency in a time At = Is 





4.4 HEISENBERG UNCERTAINTY RELATIONSHIPS 


The uncertainty relationships discussed in the previous section apply to all waves, 
and we should therefore apply them to de Broglie waves. We can use the basic de 
Broglie relationship p = h/ to relate the uncertainty in the momentum Ap to the 
uncertainty in wavelength AA, using the same procedure that we used to obtain 
Eq. 4.6. Starting with p = //i, we take differentials on both sides and obtain 
dp = (—h/d*)d2.. Now we change the differentials into differences, ignoring the 
minus sign: 
h 


Ap = Ak (4.8) 
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Werner Heisenberg (1901-1976, Ger- 
many). Best known for the uncertainty 
principle, he also developed a com- 
plete formulation of the quantum the- 
ory based on matrices. 





Electrons 





Ax~oo 











FIGURE 4.21 — Single-slit diffraction 
of electrons. A wide beam of electrons 
is incident on a narrow slit. The elec- 
trons that pass through the slit acquire 
a component of momentum in the x 
direction. 


An uncertainty in the momentum of the particle is directly related to the uncertainty 
in the wavelength associated with the particle’s de Broglie wave packet. 
Combining Eq. 4.8 with Eq. 4.4, we obtain 


AxAp ~ eh (4.9) 


Just like Eq. 4.4, this equation suggests an inverse relationship between Ax and 
Ap. The smaller the size of the wave packet of the particle, the larger is the 
uncertainty in its momentum (and thus in its velocity). 

Quantum mechanics provides a formal procedure for calculating Ax and Ap 
for wave packets corresponding to different physical situations and for different 
schemes for confining a particle. One outcome of these calculations gives the 
wave packet with the smallest possible value of the product AxAp, which turns 
out to be 4/477, as we will discuss in the next chapter. Thus ¢ = 1/4z in this case. 
All other wave packets will have larger values for AxAp. 

The combination 4/27 occurs frequently in quantum mechanics and is given 
the special symbol fi (“‘h-bar’’) 


h 
h= ip 1.05 x 107-4 J-s = 6.58 x 107!’ eV: 
IT 


In terms of 4, we can write the uncertainty relationship as 


AxAp, > sh (4.10) 
The x subscript has been added to the momentum to remind us that Eq. 4.10 
applies to motion in a given direction and relates the uncertainties in position and 
momentum in that direction only. Similar and independent relationships can be 
applied in the other directions as necessary; thus AyAp,, > h/2 or AzAp, 2 h/2. 

Equation 4.10 is the first of the Heisenberg uncertainty relationships. It sets the 
limit of the best we can possibly do in an experiment to measure simultaneously the 
location and the momentum ofa particle. Another way of interpreting this equation 
is to say that the more we try to confine a particle, the less we know about its 
momentum. 

Because the limit of //2 represents the minimum value of the product AxAp,, 
in most cases we will do worse than this limit. It is therefore quite acceptable to 
take 


AxAp, ~ h (4.11) 


as a rough estimate of the relationship between the uncertainties in location and 


momentum. 
As an example, let’s consider a beam of electrons incident on a single slit, as in 


Figure 4.21. We know this experiment as single-slit diffraction, which produces 
the characteristic diffraction pattern illustrated in Figure 4.1. We’ll assume that 
the particles are initially moving in the y direction and that we know their 
momentum in that direction as precisely as possible. If the electrons initially have 
no component of their momentum in the x direction, we know p, exactly (it is 
exactly zero), so that Ap, = 0; thus we know nothing about the x coordinates of 
the electrons (Ax = oo). This situation represents a very wide beam of electrons, 
only a small fraction of which pass through the slit. 

At the instant that some of the electrons pass through the slit, we know quite 
a bit more about their x location. In order to pass through the slit, the uncertainty 
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in their x location is no larger than a, the width of the slit; thus Ax = a. This 
improvement in our knowledge of the electron’s location comes at the expense of 
our knowledge of its momentum, however. According to Eq. 4.11, the uncertainty 
in the x component of its momentum is now Ap, ~ f/a. Measurements beyond 
the slit no longer show the particle moving precisely in the y direction (for 
which p, = 0); the momentum now has a small x component as well, with values 
distributed about zero but now with a range of roughly +//a. In passing through 
the slit, a particle acquires on the average an x component of momentum of 
roughly 4i/a, according to the uncertainty principle. 

Let us now find the angle 6 that specifies where a particle with this value of p,. 
lands on the screen. For small angles, sin 6 © tan 6 and so 

ine Sin a a 
Py y, 21a 
using A = h/p, for the de Broglie wavelength of the electrons. The first minimum 
of the diffraction pattern of a single slit is located at sin 9 = 4/a, which is larger 
than the spread of angles into which most of the particles are diffracted. The 
calculation shows that the distribution of transverse momentum given by the 
uncertainty principle is roughly equivalent to the spreading of the beam into the 
central diffraction peak, and it illustrates again the close connection between wave 
behavior and uncertainty in particle location. 

The diffraction (spreading) of a beam following passage through a slit is just 
the effect of the uncertainty principle on our attempt to specify the location of the 
particle. As we make the slit narrower, p, increases and the beam spreads even 
more. In trying to obtain more precise knowledge of the location of the particle 
by making the slit narrower, we have lost knowledge of the direction of its travel. 
This trade-off between observations of position and momentum is the essence of 
the Heisenberg uncertainty principle. 

We can also apply the second of our classical uncertainty relationships (Eq. 4.7) 
to de Broglie waves. If we assume the energy-frequency relationship for light, 
E =hf, can be applied to particles, then we immediately obtain AE = hAf. 
Combining this with Eq. 4.7, we obtain 


AEAt ~ eh (4.12) 
Once again, the minimum uncertainty wave packet gives e = 1/47, and so 
AEAt > 5h (4.13) 


This is the second of the Heisenberg uncertainty relationships. It tells us that 
the more precisely we try to determine the time coordinate of a particle, the less 
precisely we know its energy. For example, if a particle has a very short lifetime 
between its creation and decay (At > 0), a measurement of its rest energy (and 
thus its mass) will be very imprecise (AZ — oo). Conversely, the rest energy of 
a stable particle (one with an infinite lifetime, so that At = oo) can in principle be 
measured with unlimited precision (AE = 0). 
As in the case of the first Heisenberg relationship, we can take 


AEAt~ h (4.14) 


as a reasonable estimate for most wave packets. 
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The Heisenberg uncertainty relationships are the mathematical representations 
of the Heisenberg uncertainty principle, which states: 


It is not possible to make a simultaneous determination of the position and 
the momentum of a particle with unlimited precision, 


and 


It is not possible to make a simultaneous determination of the energy and 
the time coordinate of a particle with unlimited precision. 


These relationships give an estimate of the minimum uncertainty that can result 
from any experiment; measurement of the position and momentum of a particle 
will give a spread of values of widths Ax and Ap,. We may, for other reasons, do 
much worse than Eqs. 4.10 and 4.13, but we can do no better. 

These relationships have a profound impact on our view of nature. It is quite 
acceptable to say that there is an uncertainty in locating the position of a water 
wave. It is quite another matter to make the same statement about a de Broglie 
wave, because there is an implied corresponding uncertainty in the position of the 
particle. Equations 4.10 and 4.13 say that nature imposes a limit on the accuracy 
with which we can do experiments. To emphasize this point, the Heisenberg 
relationships are sometimes called “indeterminacy” rather than “uncertainty” 
principles, because the idea of uncertainty may suggest an experimental limit 
that can be reduced by using better equipment or technique. In actuality, these 
coordinates are indeterminate to the limits provided by Eqs. 4.10 and 4.13—no 


matter how hard we try, it is simply not possible to measure more precisely. 


| Example 4.5 





An electron moves in the x direction with a speed of 
3.6 x 10°m/s. We can measure its speed to a precision of 
1%. With what precision can we simultaneously measure 
its x coordinate? 


Solution 
The electron’s momentum is 


Dy = my, = (9.11 x 1073! kg)(3.6 x 10° m/s) 
= 3.3 x 10 4kg-m/s 


The uncertainty Ap, is 1% of this value, or 3.3 x 
10-*° kg- m/s. The uncertainty in position is then 


fh -1.05 x 10°**J-s 
Ap, 3.3 x 10-6kg- m/s 


x 


Ax~ 





= 3.2nm 


which is roughly 10 atomic diameters. 





| Example 4.6 





Repeat the calculations of the previous example in the case 
of a pitched baseball (m = 0.145 kg) moving at a speed of 
95 mi/h (42.5 m/s). Again assume that its speed can be 
measured to a precision of 1%. 


Solution 
The baseball’s momentum is 


Py = my, = (0.145 kg) (42.5 m/s) = 6.16kg-m/s 
The uncertainty in momentum is 6.16 x 10~* kg- m/s, and 


the corresponding uncertainty in position is 


A WOSxX10 "Is 
Ap, 6.16 x 10-2kg-m/s 





Ax ~ 1.7x 107-2 m 


x 
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This uncertainty is 19 orders of magnitude smaller than the we see that, because of the small magnitude of Planck’s 
size of an atomic nucleus. The uncertainty principle cannot constant, quantum effects are not observable for ordinary 


be blamed for the batter missing the pitch! Once again objects. 


A Statistical Interpretation of Uncertainty 


A diffraction pattern, such as that shown in Figure 4.21, is the result of the 
passage of many particles or photons through the slit. So far, we have been 
discussing the behavior of only one particle. Let’s imagine that we do an 
experiment in which a large number of particles passes (one at a time) through 
the slit, and we measure the transverse (x component) momentum of each 
particle after it passes through the slit. We can do this experiment simply by 
placing a detector at different locations on the screen where we observe the 
diffraction pattern. Because the detector actually accepts particles over a finite 
region on the screen, it measures in a range of deflection angles or equivalently 
in a range of transverse momentum. The result of the experiment might look 
something like Figure 4.22. The vertical scale shows the number of particles with 
momentum in each interval corresponding to different locations of the detector 
on the screen. The values are symmetrically arranged about zero, which indicates 
that the mean or average value of p, is zero. The width of the distribution is 
characterized by Ap,. 

Figure 4.22 resembles a statistical distribution, and in fact the precise definition 
of Ap,, is similar to that of the standard deviation o, of a quantity A that has a 
mean or average value A,,: 


o4= \V (A?) ay ~ (Ag)? 


If there are N individual measurements of A, then A,,, = N a XA, and (A), = 
N1EA?, 
By analogy, we can make a rigorous definition of the uncertainty in 


momentum as 
Ap, = V (P2av Pray? (4.15) 


The average value of the transverse momentum for the situation shown in 


Figure 4.22 is zero, so 
Ap, = \ (Pav (4.16) 


which gives in effect a root-mean-square value of p,. This can be taken to be 
a rough measure of the magnitude of p,. Thus it is often said that Ap, gives a 
measure of the magnitude of the momentum of the particle. As you can see from 
Figure 4.22, this is indeed true.* 


*The relationship between the value of Ap, calculated from Eq. 4.16 and the width of the distribution 
shown in Figure 4.22 depends on the exact shape of the distribution. You should consider the value 
from Eq. 4.16 as a rough order-of-magnitude estimate of the width of the distribution. 


Number of particles 
recorded by detector 


E AP, 








0 
Momentum 


FIGURE 4.22 Results that might be 
obtained from measuring the number 
of electrons in a given time interval 
at different locations on the screen of 
Figure 4.21. The distribution is cen- 
tered around p, = 0 and has a width 
that is characterized by Ap... 
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| Example 4.7 





In nuclear beta decay, electrons are observed to be ejected 
from the atomic nucleus. Suppose we assume that elec- 
trons are somehow trapped within the nucleus, and that 
occasionally one escapes and is observed in the laboratory. 
Take the diameter of a typical nucleus to be 1.0 x 107! m, 
and use the uncertainty principle to estimate the range of 
kinetic energies that such an electron must have. 


Solution 
If the electron were trapped in a region of width Ax ~ 
10—'4 m, the corresponding uncertainty in its momentum 
would be 


1 197 MeV - fm 


A h lite 
Px 7 10 fm 


' = 19.7 MeV/c 
: Ax 





c Ax c 


Note the use of fic = 197 MeV - fm in this calculation. This 
momentum is clearly in the relativistic regime for electrons, 
so we must use the relativistic formula to find the kinetic 


energy for a particle of momentum 19.7 MeV/c: 


K = Vpec? + (me2)? — me* 


= (19.7 MeV)? + (0.5 MeV)2 — 0.5 MeV = 19 MeV 





where we have used Eq. 4.16 to relate Ap, to p2. This 
result gives the spread of kinetic energies corresponding to 
a spread in momentum of 19.7 MeV/c. 

Electrons emitted from the nucleus in nuclear beta 
decay typically have kinetic energies of about 1 MeV, 
much smaller than the typical spread in energy required by 
the uncertainty principle for electrons confined inside the 
nucleus. This suggests that beta-decay electrons of such low 
energies cannot be confined in a region of the size of the 
nucleus, and that another explanation must be found for the 
electrons observed in nuclear beta decay. (As we discuss in 
Chapter 12, these electrons cannot preexist in the nucleus, 
which would violate the uncertainty principle, but are 
“manufactured” by the nucleus at the instant of the decay.) 





| Example 4.8 





(a) A charged pi meson has a rest energy of 140 MeV 
and a lifetime of 26 ns. Find the energy uncertainty of the 
pi meson, expressed in MeV and also as a fraction of its 
rest energy. (b) Repeat for the uncharged pi meson, with 
a rest energy of 135 MeV and a lifetime of 8.3 x 107!’s. 
(c) Repeat for the rho meson, with a rest energy of 765 MeV 
and a lifetime of 4.4 x 10774. 


Solution 

(a) Ifthe pi meson lives for 26 ns, we have only that much 
time in which to measure its rest energy, and Eq. 4.8 tells us 
that any energy measurement done in a time Af is uncertain 
by an amount of at least 


h 6.58 x 10-!%eV-s 


AE = — 
At 26 x 1079s 





=2.5x 10 eV 
= 2.5x107!'4 MeV 


AE 2.5 x 107'*MeV 
E 140 MeV 


= 1.8x 107'° 





(b) Ina similar way, 


h 6.58 x 10-'®eV-s 








At 8.3 x 10-!’5 . 
= 7.9 x 10°-° MeV 
AE 7.9 x 10-°MeV 4 
E 135 MeV 


(c) For the rho meson, 


h 658 x 107 ev. 
sj CP NE ae ey 





ee 
Ai 44 x 10-4 
= 150 MeV 
AE 150M 
BE — PUM 930 
E  765MeV 


In the first case, the uncertainty principle does not give 
a large enough effect to be measured—particle masses 
cannot be measured to a precision of 10~!° (about 10~° 
is the best precision that we can obtain). In the second 
example, the uncertainty principle contributes at about the 
level of 10~7, which approaches the limit of our measuring 


instruments and therefore might be observable in the lab- 
oratory. In the third example, we see that the uncertainty 
principle can contribute substantially to the precision of our 
knowledge of the rest energy of the rho meson; measure- 
ments of its rest energy will show a statistical distribution 
centered about 765 MeV with a spread of 150 MeV, and no 
matter how precise an instrument we use to measure the 
rest energy, we can never reduce that spread. 
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The lifetime of a very short-lived particle such as the rho 
meson cannot be measured directly. In practice we reverse 
the procedure of the calculation of this example—we 
measure the rest energy, which gives a distribution similar 
to Figure 4.22, and from the “width” AZ of the distribution 
we deduce the lifetime using Eq. 4.8. This procedure is 
discussed in Chapter 14. 





| Example 4.9 





Estimate the minimum velocity that would be measured 
for a billiard ball (m ~ 100g) confined to a billiard table 
of dimension | m. 


Solution 
For Ax © 1m, we have 


so 


Ap, 1x 10734 kg- m/ 
Mee pom = 1x 10-3 m/s 
m 0.1kg 





Thus quantum effects might result in motion of the billiard 
ball with a speed distribution having a spread of about 1 x 


10-*3 m/s. At this speed, the ball would move a distance 
of 1% of the diameter of an atomic nucleus in a time equal 
to the age of the universe! Once again, we see that quantum 
effects are not observable with macroscopic objects. 


4.5 WAVE PACKETS 


In Section 4.3, we described measurements of the wavelength or frequency of a 
wave packet, which we consider to be a finite group of oscillations of a wave. 
That is, the wave amplitude is large over a finite region of space or time and is 
very small outside that region. 

Before we begin our discussion, it is necessary to keep in mind that we are 
discussing traveling waves, which we imagine as moving in one direction with a 
uniform speed. (We’ll discuss the speed of the wave packet later.) As the wave 
packet moves, individual locations in space will oscillate with the frequency or 
wavelength that characterizes the wave packet. When we show a static picture of 
a wave packet, it doesn’t matter that some points within the packet appear to have 
positive displacement, some have negative displacement, and some may even 
have zero displacement. As the wave travels, those locations are in the process 
of oscillating, and our drawings may “freeze” that oscillation. What is important 
is the locations in space where the overall wave packet has a large oscillation 
amplitude and where it has a very small amplitude.* 

In this section we will examine how to build a wave packet by adding waves 
together. A pure sinusoidal wave is of no use in representing a particle—the wave 


h 1.05 x 10-*4J. 
Np eS * = 1x 10-4kg- mis 
Ax lm 








*By analogy, think of a radio wave traveling from the station to your receiver. At a particular instant 
of time, some points in space may have instantaneous electromagnetic field values of zero, but that 
doesn’t affect your reception of the signal. What is important is the overall amplitude of the traveling 
wave. 
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extends from —oo to +00, so the particle could be found anywhere. We would like 
the particle to be represented by a wave packet that describes how the particle is 
localized to a particular region of space, such as an atom or a nucleus. 

The key to the process of building a wave packet involves adding together 
waves of different wavelength. We represent our waves as A cos kx, where k is the 
wave number (k = 27/A) and A is the amplitude. For example, let’s add together 
two waves: 


y(x) = A, cosk,x + A, cos kyx = A, cos(2mx/d,) + Ay cos(2mx/A,) (4.17) 


This sum is illustrated in Figure 4.23a for the case A, = A, anda, = 9,A, = 11. 
This combined wave shows the phenomenon known as beats in the case of sound 
waves. So far we don’t have a result that looks anything like the wave packet we 
are after, but you can see that by adding together two different waves we have 
reduced the amplitude of the wave packet at some locations. This pattern repeats 
endlessly from —co to +00, so the particle is still not localized. 

Let’s try a more detailed sum. Figure 4.235 shows the result of adding 5 waves 
with wavelengths 9, 9.5, 10, 10.5, 11. Here we have been a bit more successful 
in restricting the amplitude of the wave packet in some regions. By adding even 
more waves with a larger range of wavelengths, we can obtain still narrower 
regions of large amplitude: Figure 4.23c shows the result of adding 9 waves 
with wavelengths 8,8.5,9,...,12, and Figure 4.23d shows the result of adding 
13 waves of wavelengths 7,7.5,8,...,13. Unfortunately, all of these patterns 
(including the regions of large amplitude) repeat endlessly from —oo to +00, so 
even though we have obtained increasingly large regions where the wave packet 
has small amplitude, we haven’t yet created a wave packet that might represent 
a particle localized to a particular region. If these wave packets did represent 
particles, then the particle would not be confined to any finite region. 





















































FIGURE 4.23 (a) Adding two waves of wavelengths 9 and 11 gives beats. 
(6) Adding 5 waves with wavelengths ranging from 9 to 11. (c) Adding 9 
waves with wavelengths ranging from 8 to 12. (d) Adding 13 waves with 
wavelengths from 7 to 13. All of the patterns repeat from —oo to +00. 


The regions of large amplitude in Figures 4.23,c,d do show how adding more 
waves of a greater range of wavelengths helps to restrict the size of the wave 
packet. The region of large amplitude in Figure 4.236 ranges from about x = —40 
to +40, while in Figure 4.23c it is from about x = —20 to +20 and in Figure 4.23d 
from about x = —15 to+15. This shows again the inverse relationship between Ax 
and AA expected for wave packets given by Eq. 4.4: as the range of wavelengths 
increases from 2 to 4 to 6, the size of the “allowed” regions decreases from about 
80 to 40 to 30. Once again we find that to restrict the size of the wave packet we 
must sacrifice the precise knowledge of the wavelength. 

Note that for all four of these wave patterns, the disturbance seems to have a 
wavelength of about 10, equal to the central wavelength of the range of values 
of the functions we constructed. We can therefore regard these functions as a 
cosine wave with a wavelength of 10 that is shaped or modulated by the other 
cosine waves included in the function. For example, for the case of A, = A, = A, 
Eq. 4.17 can be rewritten after a bit of trigonometric manipulation as 


ti) my TUX TTX TUX ie TUX 
= COS COS 
~~ Ay hy Ay hy 


If A, and A, are close together (that is, if AA = A, —A, <A,,A,), this can be 


approximated as 
Ad 20 
y(x) = 2A cos( ; ‘) cos ( *) 
hay Aav 


where A, = (A; +A,)/2 © A, or Ay. The second cosine term represents a wave 
with a wavelength of 10, and the first cosine term provides the shaping envelope 
that produces the beats. 

Any finite combination of waves with discrete wavelengths will produce 
patterns that repeat between —co to +00, so this method of adding waves will 
not work in constructing a finite wave packet. To construct a wave packet with a 
finite width, we must replace the first cosine term in Eq. 4.19 with a function that 
is large in the region where we want to confine the particle but that falls to zero 
as x —> oo. For example, the simplest function that has this property is 1/x, so 
we might imagine a wave packet whose mathematical form is 


(x) 2A . (AdAnrx 2x 
x) = — sin cos {| —— 
. x rN Xo 
Here A, represents the central wavelength, replacing 1,,,. (In going from Eq. 4.19 
to Eq. 4.20, the cosine modulating term has been changed to a sine; otherwise the 
function would blow up at x = 0.) This function is plotted in Figure 4.24a. It looks 
more like the kind of function we are seeking—it has large amplitude only in a 


small region of space, and the amplitude drops rapidly to zero outside that region. 
Another function that has this property is the Gaussian modulating function: 





(4.18) 








(4.19) 





(4.20) 


2 
yx) = Ae 244725 cog (=*) (4.21) 


0 


which is shown in Figure 4.245. 
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FIGURE 4.24 


in which the modulation envelope 


(a) A wave packet 


decreases in amplitude like 1/x. 
(b) A wave packet with a Gaussian 
modulating function. Both curves are 
drawn for Ay = 10 and Ad = 0.58, 
which corresponds approximately to 
Figure 4.235. 
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Both of these functions show the characteristic inverse relationship between an 
arbitrarily defined size of the wave packet Ax and the wavelength range parameter 
AA that is used in constructing the wave packet. For example, consider the wave 
packet shown in Figure 4.24a. Let’s arbitrarily define the width of the wave packet 
as the distance over which the amplitude of the central region falls by 1/2. That 
occurs roughly where the argument of the sine has the value +7/2, which gives 
AxAh ~ ye consistent with our classical uncertainty estimate. 

These wave packets can also be constructed by adding together waves of 
differing amplitude and wavelength, but the wavelengths form a continuous rather 
than a discrete set. It is a bit easier to illustrate this if we work with wave number 
k = 27/d rather than wavelength. So far we have been adding waves in the form 
of A cos kx, so that 


y(x) = > 4, coskx (4.22) 


where k; = 27/i,. The waves plotted in Figure 4.23 represent applications of the 
general formula of Eq. 4.22 carried out over different numbers of discrete waves. 
If we have a continuous set of wave numbers, the sum in Eq. 4.22 becomes an 
integral: 


y(x) = / A(k) cos kx dk (4.23) 


where the integral is carried out over whatever range of wave numbers is permitted 
(possibly infinite). 

For example, suppose we have a range of wave numbers from ky — Ak/2 to 
ky + Ak/2 that is a continuous distribution of wave numbers of width Ak centered 
at ky. If all of the waves have the same amplitude Aj, then from Eq. 4.23 the form 
of the wave packet can be shown to be (see Problem 24 at the end of the chapter) 


2A Ak 
y(x) = —2 sin (Fs) cos kyx (4.24) 
x 


This is identical with Eq. 4.20 with ky =2m/A, and Ak = 27 AA/A4. This 
relationship between Ak and Ad follows from a procedure similar to what was used 
to obtain Eq. 4.6. With k = 27r/A, taking differentials gives dk = —(27/A?)dh. 
Replacing the differentials with differences and ignoring the minus sign gives the 
relationship between Ak and Ad. 

A better approximation of the shape of the wave packet can be found by letting 
A(k) vary according to a Gaussian distribution 4(k) = Aje~'0" 24" | This 
gives a range of wave numbers that has its largest contribution at the central 
wave number ky and falls off to zero for larger or smaller wave numbers with a 
characteristic width of Ak. Applying Eq. 4.23 to this case, with k ranging from 
—oo to +00 gives (see Problem 25) 


yx) = AyAkV 2 eS"? cog kegx (4.25) 


which shows how the form of Eq. 4.21 originates. 
By specifying the distribution of wavelengths, we can construct a wave packet 
of any desired shape. A wave packet that restricts the particle to a region in space 
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of width Ax will have a distribution of wavelengths characterized by a width Ad. 
The smaller we try to make Ax, the larger will be the spread Ad of the wavelength 
distribution. The mathematics of this process gives a result that is consistent with 
the uncertainty relationship for classical waves (Eq. 4.4). 


4.6 THE MOTION OF A WAVE PACKET 


Let’s consider again the “beats” wave packet represented by Eq. 4.17 and 
illustrated in Figure 4.23a. We now want to turn our “static” waves into traveling 
waves. It will again be more convenient for this discussion to work with the wave 
number k instead of the wavelength. To turn a static wave v(x) = A cos kx into a 
traveling wave moving in the positive x direction, we replace kx with kx — wt, so 
that the traveling wave is written as y(x, t) = A cos(kx — wf). (For motion in the 
negative x direction, we would replace kx with kx + wt.) Here @ is the circular 
frequency of the wave: w = 22f. The combined traveling wave then would be 
represented as 


y(x, t) = Ay cos(k,x = @,?) + Ay cos(kyx = Qt) (4.26) 


For any individual wave, the wave speed is related to its frequency and wavelength 
according to v = Af. In terms of the wave number and circular frequency, we can 
write this as v = (27/k)(w/27), so v= w/k. This quantity is sometimes called 
the phase speed and represents the speed of one particular phase or component 
of the wave packet. In general, each individual component may have a different 
phase speed. As a result, the shape of the wave packet may change with time. 

For Figure 4.23a, we chose A; = A, andA, = 9,A, = 11. Let’s choose v; = 6 
units/s and v, = 4 units/s. Figure 4.25 shows the waveform at a time of f = 1s. In 
that time, wave | will have moved a distance of 6 units in the positive x direction 
and wave 2 will have moved a distance of 4 units in the positive x direction. 
However, the combined wave moves a much greater distance in that time: the 
center of the beat that was formerly at x = 0 has moved to x = 15 units. How is 
it possible that the combined waveform moves faster than either of its component 
waves? 




















FIGURE 4.25 The solid line shows the waveform of Figure 4.23a 
at t= 1s, and the dashed line shows the same waveform at t = 0. 
Note that the peak that was originally at x = 0 has moved to x = 15 
att=l1s. 
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| Example 4.10 


To produce the peak at x = 0 and ¢ = 0, the two component waves were exactly 
in phase—their two maxima lined up exactly to produce the combined maximum 
of the wave. At x = 15 units and ¢ = 1s, two individual maxima line up once 
again to produce a combined maximum. They are not the same two maxima that 
lined up to produce the maximum at f¢ = 0, but it happens that two other maxima 
are in phase at x = 15 units and t = 1s to produce the combined maximum. If we 
were to watch an animation of the wave, we would see the maximum originally 
at x = 0 move gradually to x = 15 between t = 0 andt= Is. 

We can understand how this occurs by writing Eq. 4.26 in a form similar to 
Eq. 4.18 using trigonometric identities. The result is (again assuming A, = A, = 
A, as we did in Eq. 4.18): 


Ak oA ky +k 
Yeast) = 24.008 (Ss — =) cos ( a 2 ue e)) (4.27) 





As in Eq. 4.18, the second term in Eq. 4.27 represents the rapid variation of the 
wave within the envelope given by the first term. It is the first term that dictates the 
overall shape of the waveform, so it is this term that determines the speed of travel 
of the waveform. For a wave that is written as cos(Ax — wf), the speed is w/k. For 
this wave envelope, the speed is (Aw/2)/(Ak/2) = Aw/Ak. This speed is called 
the group speed of the wave packet. As we have seen, the group speed of the 
wave packet can be very different from the phase speed of the component waves. 
For more complicated situations than the two-component “beat” waveform, the 
group speed can be generalized by turning the differences into differentials: 


dw 


Vgroup = Tk (4.28) 


The group speed depends on the relationship between frequency and wavelength 
for the component waves. If the phase speed of all component waves is the same 
and is independent of frequency or wavelength (as, for example, light waves 
in empty space), then the group speed is identical to the phase speed and the 
wave packet keeps its original shape as it travels. In general, the propagation 
of a component wave depends on the properties of the medium, and different 
component waves will travel with different speeds. Light waves in glass or sound 
waves in most solids travel with a speed that varies with frequency or wavelength, 
and so their wave packets change shape as they travel. De Broglie waves in general 
have different phase speeds, so that their wave packets expand as they travel. 





Certain ocean waves travel with a phase velocity Vphase= But with Vohase = @/k, we have w/k = \/g/k, 80 @ = \/gk 
vy gA/2m , where g is the acceleration due to gravity. What and Eq. 4.28 gives 
is the group velocity of a “wave packet” of these waves? 


Solution 





dow d -— 1 /g 1 /gar 
fs e : Veroup = = gk= = 
With k = 27/i, we can write the phase velocity as a dk dk 2Vk 2) 2n 


function of k as 


Vphase = V g/k 


Note that the group speed of the wave packet increases as 
the wavelength increases. 
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The Group Speed of deBroglie Waves 


Suppose we have a localized particle, represented by a group of de Broglie 
waves. For each component wave, the energy of the particle is related to the 
frequency of the de Broglie wave by E = hf = ha, and so dE = hdw. Similarly, 
the momentum of the particle is related to the wavelength of the de Broglie wave 
by p = h/d = hk, so dp = hdk. The group speed of the de Broglie wave then can 
be expressed as 


dw dE/h dE 
Vv. = — = 
soup dk-~—s dpf/h_— dp 





(4.29) 


For a classical particle having only kinetic energy E = K = p?/2m, we can find 


dE/dp as 
dE d (p’ 77) 
— — — 4. 
ae ara aa 





which is the velocity of the particle. 
Combining Eqs. 4.29 and 4.30 we obtain an important result: 


(4.31) 


Voroup = Voarticle 


The speed of a particle is equal to the group speed of the corresponding wave 
packet. The wave packet and the particle move together—wherever the particle 
goes, its de Broglie wave packet moves along with it like a shadow. If we do a 
wave-type experiment on the particle, the de Broglie wave packet is always there 
to reveal the wave behavior of the particle. A particle can never escape its wave 
nature! 


The Spreading of a Moving Wave Packet 


Suppose we have a wave packet that represents a confined particle at t = 0. For 
example, the particle might have passed through a single-slit apparatus. Its initial 
uncertainty in position is Ax, and its initial uncertainty in momentum is Ap,p. 
The wave packet moves in the x direction with velocity v,, but that velocity is 
not precisely known—the uncertainty in its momentum gives a corresponding 
uncertainty in velocity: Av,, = Ap, /m. Because there is an uncertainty in the 
velocity of the wave packet, we can’t be sure where it will be located at time 
t. That is, its location at time ¢ is x = v,t, with velocity v, = v,9 + Av,9. Thus 
there are two contributions to the uncertainty in its location at time ¢: the initial 
uncertainty Ax, and an additional amount equal to Av, f that represents the 
spreading of the wave packet. We’ll assume that these two contributions add 
quadratically, like experimental uncertainties, so that the total uncertainty in the 
location of the particle is 








Ax = y[(Axp)? + (Avot)? = (Axo)? + (Apaot/m)? (4.32) 


Taking Ap,, = /Axp according to the uncertainty principle, we have 





Ax = y(Axp)? + (lit/mAxy? (4.33) 


Ifwe try to make the wave packet very small at ¢ = 0 (Axp is small), then the second 
term under the square root makes the wave packet expand rapidly, because Ax, 
appears in the denominator of that term. The more successful we are at confining 
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a wave packet, the more quickly it spreads. This reminds us of the single-slit 
experiment discussed in Section 4.4: the narrower we make the slit, the more the 
waves diverge after passing through the slit. Figure 4.26 shows how the size of the 
wave packet expands with time for two different initial sizes, and you can see that 
the smaller initial wave packet grows more rapidly than the larger initial packet. 


4.7 PROBABILITY AND RANDOMNESS 
O 1 2 3 4 5 


t Any single measurement of the position or momentum of a particle can be made 
FIGURE 4.26 The smaller the initial with as much precision as our experimental skill permits. How then does the 
; .. _wavelike behavior of a particle become observable? How does the uncertainty in 
wave packet, the more quickly it as . 
position or momentum affect our experiment? 
ee Suppose we prepare an atom by attaching an electron to a nucleus. (For 
this example we regard the nucleus as being fixed in space.) Some time after 
preparing our atom, we measure the position of the electron. We then repeat the 
procedure, preparing the atom in an identical way, and find that a remeasurement 
of the position of the electron yields a value different from that found in our 
first measurement. In fact, each time we repeat the measurement, we may obtain 
a different outcome. If we repeat the measurement a large number of times, we 
find ourselves led to a conclusion that runs counter to a basic notion of classical 
physics—systems that are prepared in identical ways do not show identical 
subsequent behavior. What hope do we then have of constructing a mathematical 
theory that has any usefulness at all in predicting the outcome of a measurement, 
if that outcome is completely random? 

The solution to this dilemma lies in the consideration of the probability of 
obtaining any given result from an experiment whose possible results are subject 
to the laws of statistics. We cannot predict the outcome of a single flip of a 
coin or roll of the dice, because any single result is as likely as any other single 
result. We can, however, predict the distribution of a large number of individual 
measurements. For example, on a single flip of a coin, we cannot predict whether 
the outcome will be “heads” or “tails”; the two are equally likely. If we make a 
large number of trials, we expect that approximately 50% will turn up “heads” and 
50% will yield “tails”; even though we cannot predict the result of any single toss 
of the coin, we can predict reasonably well the result of a large number of tosses. 

Our study of systems governed by the laws of quantum physics leads us to a 
similar situation. We cannot predict the outcome of any single measurement of 
the position of the electron in the atom we prepared, but if we do a large number 
of measurements, we ought to find a statistical distribution of results. We cannot 
develop a mathematical theory that predicts the result of a single measurement, but 
we do have a mathematical theory that predicts the statistical behavior of a system 
(or of a large number of identical systems). The quantum theory provides this 
mathematical procedure, which enables us to calculate the average or probable 
outcome of measurements and the distribution of individual outcomes about the 
average. This is not such a disadvantage as it may seem, for in the realm of 
quantum physics, we seldom do measurements with, for example, a single atom. 
If we were studying the emission of light by a radiant system or the properties 
of a solid or the scattering of nuclear particles, we would be dealing with a large 
number of atoms, and so our concept of statistical averages is very useful. 

In fact, such concepts are not as far removed from our daily lives as we might 
think. For example, what is meant when the TV weather forecaster “predicts” 




















4.7 | Probability and Randomness 


a 50% chance of rain tomorrow? Will it rain 50% of the time, or over 50% 
of the city? The proper interpretation of the forecast is that the existing set of 
atmospheric conditions will, in a large number of similar cases, result in rain in 
about half the cases. A surgeon who asserts that a patient has a 50% chance of 
surviving an operation means exactly the same thing—experience with a large 
number of similar cases suggests recovery in about half. 

Quantum mechanics uses similar language. For example, if we say that the 
electron in a hydrogen atom has a 50% probability of circulating in a clockwise 
direction, we mean that in observing a large collection of similarly prepared atoms 
we find 50% to be circulating clockwise. Of course a single measurement shows 
either clockwise or counterclockwise circulation. (Similarly, it either rains or it 
doesn’t; the patient either lives or dies.) 

Of course, one could argue that the flip of a coin or the roll of the dice is not 
a random process, but that the apparently random nature of the outcome simply 
reflects our lack of knowledge of the state of the system. For example, if we knew 
exactly how the dice were thrown (magnitude and direction of initial velocity, 
initial orientation, rotational speed) and precisely what the laws are that govern 
their bouncing on the table, we should be able to predict exactly how they would 
land. (Similarly, if we knew a great deal more about atmospheric physics or 
physiology, we could predict with certainty whether or not it will rain tomorrow 
or an individual patient will survive.) When we instead analyze the outcomes in 
terms of probabilities, we are really admitting our inability to do the analysis 
exactly. There is a school of thought that asserts that the same situation exists in 
quantum physics. According to this interpretation, we could predict exactly the 
behavior of the electron in our atom if only we knew the nature of a set of so-called 
“hidden variables” that determine its motion. However, experimental evidence 
disagrees with this theory, and so we must conclude that the random behavior of 
a system governed by the laws of quantum physics is a fundamental aspect of 
nature and not a result of our limited knowledge of the properties of the system. 


The Probability Amplitude 


What does the amplitude of the de Broglie wave represent? In any wave 
phenomenon, a physical quantity such as displacement or pressure varies with 
location and time. What is the physical property that varies as the de Broglie wave 
propagates? 

A localized particle is represented by a wave packet. If a particle is confined 
to a region of space of dimension Ax, its wave packet has large amplitude only in 
a region of space of dimension Ax and has small amplitude elsewhere. That is, 
the amplitude is large where the particle is likely to be found and small where the 
particle is less likely to be found. The probability of finding the particle at any 
point depends on the amplitude of its de Broglie wave at that point. In analogy 
with classical physics, in which the intensity of any wave is proportional to the 
square of its amplitude, we have 


probability to observe particles « | de Broglie wave amplitude |? 
Compare this with the similar relationship for photons discussed in Section 3.6: 
probability to observe photons « | electric field amplitude |? 


Just as the electric field amplitude of an electromagnetic wave indicates regions 
of high and low probability for observing photons, the de Broglie wave performs 
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FIGURE 4.27 The buildup of an electron interference pattern as increasing numbers of electrons are detected: (a) 100 
electrons; (b) 3000 electrons; (c) 70,000 electrons. (Reprinted with permission from Akira Tonomura, Hitachi, Ltd, 
T. Matsuda and T. Kawasaki, Advanced Research Laboratory. From American Journal of Physics 57, 117. (Copyright 
1989) American Association of Physics Teachers.) 


the same function for particles. Figure 4.27 illustrates this effect, as individual 
electrons in a double-slit type of experiment eventually produce the characteristic 
interference fringes. The path of each electron is guided by its de Broglie wave 
toward the allowed regions of high probability. This statistical effect is not 
apparent for a small number of electrons, but it becomes quite apparent when a 
large number of electrons has been detected. 

In the next chapter we discuss the mathematical framework for computing the 
wave amplitudes for a particle in various situations, and we also develop a more 
rigorous mathematical definition of the probability. 


Chapter Summary 
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When an electron moves with a certain de Broglie wave- 
length, does any aspect of the electron’s motion vary with 
that wavelength? 

Imagine a different world in which the laws of quantum 
physics still apply, but which has h = 1J-s. What might be 
some of the difficulties of life in such a world? (See Mr. 


Tompkins in Paperback by George Gamow for an imaginary 
account of such a world.) 

Suppose we try to measure an unknown frequency f by 
listening for beats between f and a known (and controllable) 
frequency f’. (We assume f” is known to arbitrarily small 
uncertainty.) The beat frequency is |f’ — f|. If we hear no 


10. 


beats, then we conclude that f = f’. (2) How long must we 
listen to hear “no” beats? (b) If we hear no beats in one 
second, how accurately have we determined f? (c) If we 
hear no beats in 10 s, how accurately? In 100 s? (d) How is 
this experiment related to Eq. 4.7? 

What difficulties does the uncertainty principle cause in 
trying to pick up an electron with a pair of forceps? 

Does the uncertainty principle apply to nature itself or only 
to the results of experiments? That is, is it the position and 
momentum that are really uncertain, or merely our knowl- 
edge of them? What is the difference between these two 
interpretations? 

The uncertainty principle states in effect that the more we 
try to confine an object, the faster we are likely to find it 
moving. Is this why you can’t seem to keep money in your 
pocket for long? Make a numerical estimate. 

Consider a collection of gas molecules trapped in a con- 
tainer. As we move the walls of the container closer together 
(compressing the gas) the molecules move faster (the tem- 
perature increases). Does the gas behave this way because 
of the uncertainty principle? Justify your answer with some 
numerical estimates. 

Many nuclei are unstable and undergo radioactive decay to 
other nuclei. The lifetimes for these decays are typically of 
the order of days to years. Do you expect that the uncertainty 
principle will cause a measurable effect in the precision 
to which we can measure the masses of atoms of these 
nuclei? 

Just as the classical limit of relativity can be achieved by 
letting c > ov, the classical limit of quantum behavior is 
achieved by letting 4 > 0. Consider the following in the 
h — 0 limit and explain how they behave classically: the 
size of the energy quantum of an electromagnetic wave, 
the de Broglie wavelength of an electron, the Heisenberg 
uncertainty relationships. 

Assume the electron beam in a television tube is acceler- 
ated through a potential difference of 25kV and then passes 
through a deflecting capacitor of interior width 1 cm. Are 


4.1 De Broglie’s Hypothesis 


iF 


Find the de Broglie wavelength of (a) a 5-MeV pro- 
ton; (b) a 50-GeV electron; (c) an electron moving at 
v= 1.00 x 10°m/s. 

The neutrons produced in a reactor are known as thermal 
neutrons, because their kinetic energies have been reduced 
(by collisions) until K = 3kT , where T is room temperature 
(293 K). (a) What is the kinetic energy of such neutrons? 
(b) What is their de Broglie wavelength? Because this 
wavelength is of the same order as the lattice spacing of 


11. 


12. 


13. 


14. 


1D. 


16. 


Problems 129 


diffraction effects important in this case? Justify your answer 
with a calculation. 

The structure of crystals can be revealed by X-ray diffrac- 
tion (Figures 3.7 and 3.8), electron diffraction (Figure 4.2), 
and neutron diffraction (Figure 4.7). In what ways do these 
experiments reveal similar structure? In what ways are they 
different? 

Often it happens in physics that great discoveries are made 
inadvertently. What would have happened if Davisson and 
Germer had their accelerating voltage set below 32 V? 
Suppose we cover one slit in the two-slit electron experiment 
with a very thin sheet of fluorescent material that emits a 
photon of light whenever an electron passes through. We 
then fire electrons one at a time at the double slit; whether 
or not we see a flash of light tells us which slit the electron 
went through. What effect does this have on the interference 
pattern? Why? 

In another attempt to determine through which slit the elec- 
tron passes, we suspend the double slit itself from a very 
fine spring balance and measure the “recoil” momentum of 
the slit as a result of the passage of the electron. Electrons 
that strike the screen near the center must cause recoils 
in opposite directions depending on which slit they pass 
through. Sketch such an apparatus and describe its effect 
on the interference pattern. (Hint: Consider the uncertainty 
principle AxAp, ~ h as applied to the motion of the slits 
suspended from the spring. How precisely do we know the 
position of the slit?) 

It is possible for Vphase 
greater than c? 

In a nondispersive medium, Vgroup = Vphase> this is another 
way of saying that all waves travel with the same phase 
velocity, no matter what their wavelengths. Is this true for 
(a) de Broglie waves? (6) Light waves in glass? (c) Light 
waves in vacuum? (d) Sound waves in air? What diffi- 
culties would be encountered in attempting communication 
(by speech or by radio signals for example) in a strongly 
dispersive medium? 


to be greater than c? Can Vy,ou, be 


the atoms of a solid, neutron diffraction (like X-ray and 
electron diffraction) is a useful means of studying solid 
lattices. 

By doing a nuclear diffraction experiment, you measure 
the de Broglie wavelength of a proton to be 9.16 fm. (a) 
What is the speed of the proton? (6) Through what potential 
difference must it be accelerated to achieve that speed? 

A proton is accelerated from rest through a potential 
difference of —2.36 x 10°V. What is its de Broglie 
wavelength? 
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4.2 Experimental Evidence for de Broglie Waves 


2. 


10. 


LI, 


Find the potential difference through which electrons must 
be accelerated (as in an electron microscope, for example) 
if we wish to resolve: (a) a virus of diameter 12 nm; (5) an 
atom of diameter 0.12 nm; (c) a proton of diameter 1.2 fm. 
In an electron microscope we wish to study particles of diam- 
eter about 0.10 4m (about 1000 times the size of a single 
atom). (a) What should be the de Broglie wavelength of the 
electrons? (6) Through what potential difference should the 
electrons be accelerated to have that de Broglie wavelength? 
In order to study the atomic nucleus, we would like to 
observe the diffraction of particles whose de Broglie wave- 
length is about the same size as the nuclear diameter, about 
14 fm for a heavy nucleus such as lead. What kinetic energy 
should we use if the diffracted particles are (a) electrons? 
(b) Neutrons? (c) Alpha particles (m = 4u)? 

In the double-slit interference pattern for helium atoms 
(Figure 4.13), the kinetic energy of the beam of atoms 
was 0.020 eV. (a) What is the de Broglie wavelength of a 
helium atom with this kinetic energy? (5) Estimate the de 
Broglie wavelength of the atoms from the fringe spacing 
in Figure 4.13, and compare your estimate with the value 
obtained in part (a). The distance from the double slit to the 
scanning slit is 64cm. 

Suppose we wish to do a double-slit experiment with a beam 
of the smoke particles of Example 4.1c. Assume we can 
construct a double slit whose separation is about the same 
size as the particles. Estimate the separation between the 
fringes if the double slit and the screen were on opposite 
coasts of the United States. 

In the Davisson-Germer experiment using a Ni crystal, a 
second-order beam is observed at an angle of 55°. For what 
accelerating voltage does this occur? 

A certain crystal is cut so that the rows of atoms on its 
surface are separated by a distance of 0.352nm. A beam 
of electrons is accelerated through a potential difference of 
175 V and is incident normally on the surface. If all possible 
diffraction orders could be observed, at what angles (relative 
to the incident beam) would the diffracted beams be found? 


4.3 Uncertainty Relationships for Classical Waves 


12. 


13. 


Suppose a traveling wave has a speed v (where v= Af). 
Instead of measuring waves over a distance Ax, we stay in 
one place and count the number of wave crests that pass in 
atime At. Show that Eq. 4.7 is equivalent to Eq. 4.4 for this 
case. 


Sound waves travel through air at a speed of 330 m/s. A 
whistle blast at a frequency of about 1.0 kHz lasts for 2.0 s. 
(a) Over what distance in space does the “wave train” repre- 
senting the sound extend? (b) What is the wavelength of the 
sound? (c) Estimate the precision with which an observer 
could measure the wavelength. (d) Estimate the precision 
with which an observer could measure the frequency. 


14. 


15. 


16. 


A stone tossed into a body of water creates a disturbance at 
the point of impact that lasts for 4.0 s. The wave speed is 
25 cm/s. (a) Over what distance on the surface of the water 
does the group of waves extend? (6) An observer counts 12 
wave crests in the group. Estimate the precision with which 
the wavelength can be determined. 

A radar transmitter emits a pulse of electromagnetic radia- 
tion with wavelength 0.225 m. The pulses have a duration of 
1.17 ws. The receiver is set to accept a range of frequencies 
about the central frequency. To what range of frequencies 
should the receiver be set? 

Estimate the signal processing time that would be necessary 
if you want to design a device to measure frequencies to a 
precision of no worse than 10,000 Hz. 


4.4 Heisenberg Uncertainty Relationships 


17. 


18. 


19. 


20. 


21; 


The speed of an electron is measured to within an uncertainty 
of 2.0 x 104 m/s. What is the size of the smallest region of 
space in which the electron can be confined? 

An electron is confined to a region of space of the size of an 
atom (0.1 nm). (a) What is the uncertainty in the momentum 
of the electron? (b) What is the kinetic energy of an elec- 
tron with a momentum equal to Ap? (c) Does this give a 
reasonable value for the kinetic energy of an electron in an 
atom? 

The &* particle has a rest energy of 1385 MeV and a lifetime 
of 2.0 x 1073 s. What would be a typical range of outcomes 
of measurements of the &* rest energy? 

A pi meson (pion) and a proton can briefly join together to 
form a A particle. A measurement of the energy of the zp 
system (Figure 4.28) shows a peak at 1236 MeV, correspond- 
ing to the rest energy of the A particle, with an experimental 
spread of 120 MeV. What is the lifetime of the A? 
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FIGURE 4.28 Problem 20. 


A nucleus emits a gamma ray of energy 1.0MeV from a 
state that has a lifetime of 1.2ns. What is the uncertainty 
in the energy of the gamma ray? The best gamma-ray 
detectors can measure gamma-ray energies to a precision of 
no better than a few eV. Will this uncertainty be directly 
measurable? 


22. In special conditions (see Section 12.9), it is possible to 
measure the energy of a gamma-ray photon to | part in 
10!>. For a photon energy of 50 keV, estimate the maximum 
lifetime that could be determined by a direct measurement 
of the spread of the photon energy. 

23. Alpha particles are emitted in nuclear decay processes with 
typical energies of 5 MeV. In analogy with Example 4.7, 
deduce whether the alpha particle can exist inside the 
nucleus. 


4.5 Wave Packets 


24. Use a distribution of wave numbers of constant amplitude 
ina range Ak about ko: 


Ak Ak 
A(k) = Ap ky -— <k<kt+— 
2 2 
=0 otherwise 


and obtain Eq. 4.24 from Eq. 4.23. 
25. Use the distribution of wave numbers A(k) = 
Aye ko)" 208k? for k = —oo to +00 to derive Eq. 4.25. 
26. Do the trigonometric manipulation necessary to obtain 
Eq. 4.18. 


4.6 The Motion of a Wave Packet 


27. Show that the data used in Figure 4.25 are consistent with 
Eq. 4.27; that is, use A, = 9 anda, = 1l,v,; = 6andv, = 4 





to show that Veroup = 15. 
28. (a) Show that the group velocity and phase velocity are 
related by: 
AV hase 
Veroup — VYphase — dh 


(6) When white light travels through glass, the phase veloc- 

ity of each wavelength depends on the wavelength. (This is 

the origin of dispersion and the breaking up of white light 
into its component colors—different wavelengths travel at 
different speeds and have different indices of refraction.) 

How does Vypase depend on A? Is dvynace/dA positive or 

negative? Therefore, is Veroup > Vphase OF < Vphase? 

29. Certain surface waves in a fluid travel with phase velocity 
/b/X, where b is a constant. Find the group velocity of a 
packet of surface waves, in terms of the phase velocity. 

30. By acalculation similar to that of Eq. 4.30, show that dE/dp 
=v remains valid when E represents the relativistic kinetic 
energy of the particle. 


General Problems 


31. A free electron bounces elastically back and forth in one 
dimension between two walls that are L = 0.50 nm apart. 
(a) Assuming that the electron is represented by a de Broglie 
standing wave with a node at each wall, show that the permit- 
ted de Broglie wavelengths are A,, = 2L/n (n = 1,2,3,...). 
(6) Find the values of the kinetic energy of the electron for 
n = 1,2, and 3. 


32. 


33. 


34. 


3); 


36. 


Problems 131 


A beam of thermal neutrons (see Problem 2) emerges from 
a nuclear reactor and is incident on a crystal as shown in 
Figure 4.29. The beam is Bragg scattered, as in Figure 3.5, 
from a crystal whose scattering planes are separated by 
0.247nm. From the continuous energy spectrum of the 
beam we wish to select neutrons of energy 0.0105 eV. Find 
the Bragg-scattering angle that results in a scattered beam 
of this energy. Will other energies also be present in the 
scattered beam at that angle? 
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Scattering 
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FIGURE 4.29 Problem 32. 


(a) Find the de Broglie wavelength of a nitrogen molecule 
in air at room temperature (293 K). (b) The density of air at 
room temperature and atmospheric pressure is 1.292 kg/m. 
Find the average distance between air molecules at this tem- 
perature and compare with the de Broglie wavelength. What 
do you conclude about the importance of quantum effects 
in air at room temperature? (c) Estimate the temperature at 
which quantum effects might become important. 

In designing an experiment, you want a beam of photons and 
a beam of electrons with the same wavelength of 0.281 nm, 
equal to the separation of the Na and Cl ions in a crystal of 
NaCl. Find the energy of the photons and the kinetic energy 
of the electrons. 

A nucleus of helium with mass 5 u breaks up from rest into 
a nucleus of ordinary helium (mass = 4 u) plus a neutron 
(mass = | u). The rest energy liberated in the break-up is 
0.89 MeV, which is shared (not equally) by the products. 
(a) Using energy and momentum conservation, find the 
kinetic energy of the neutron. (b) The lifetime of the orig- 
inal nucleus is 1.0 x 1077! s. What range of values of the 
neutron kinetic energy might we measure in the laboratory 
as a result of the uncertainty relationship? 

In a metal, the conduction electrons are not attached to any 
one atom, but are relatively free to move throughout the 
entire metal. Consider a cube of copper measuring 1.0 cm on 
each edge. (a) What is the uncertainty in any one component 
of the momentum of an electron confined to the metal? 
(b) Estimate the average kinetic energy of an electron in 
the metal. (Assume Ap = [(Ap,)? + (Ap,)* + (Ap,)?]'/7,) 
(c) Assuming the heat capacity of copper to be 
24.5J/mole-K, would the contribution of this motion to 
the internal energy of the copper be important at room 
temperature? What do you conclude from this? (See also 
Problem 38.) 
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37. 


38. 


A proton or a neutron can sometimes “violate” conservation 
of energy by emitting and then reabsorbing a pi meson, 
which has a mass of 135 MeV/c’. This is possible as long 
as the pi meson is reabsorbed within a short enough time 
At consistent with the uncertainty principle. (a) Consider 
Pp —p+a. By what amount AF is energy conservation 
violated? (Ignore any kinetic energies.) (b) For how long 
a time Af can the pi meson exist? (c) Assuming the pi 
meson to travel at very nearly the speed of light, how far 
from the proton can it go? (This procedure, as we discuss in 
Chapter 12, gives us an estimate of the range of the nuclear 
force, because protons and neutrons are held together in the 
nucleus by exchanging pi mesons.) 

In a crystal, the atoms are a distance L apart; that is, each 
atom must be localized to within a distance of at most L. 
(a) What is the minimum uncertainty in the momentum of 


39. 


the atoms of a solid that are 0.20nm apart? (b) What is 
the average kinetic energy of such an atom of mass 65 u? 
(c) What would a collection of such atoms contribute to 
the internal energy of a typical solid, such as copper? Is 
this contribution important at room temperature? (See also 
Problem 36.) 

An apparatus is used to prepare an atomic beam by heat- 
ing a collection of atoms to a temperature 7 and allowing 
the beam to emerge through a hole of diameter d in one 
side of the oven. The beam then travels through a straight 
path of length L. Show that the uncertainty principle causes 
the diameter of the beam at the end of the path to be 
larger than d by an amount of order Li/d/3mkT, where 
m is the mass of an atom. Make a numerical estimate 
for typical values of T = 1500 K,m = 7u (lithium atoms), 
d=3mm,L=2m. 


Quantum mechanics provides a mathematical framework in which the description of a 
process often includes different and possibly contradictory outcomes. A favorite illustration 
of that situation is the case of Schrédinger’s cat. The cat is confined in a chamber with a 
radioactive atom, the decay of which will trigger the release of poison from a vial. Because 
Nimo lolala ml caleyiVm=>.¢-(ei Nani Za(=10m tat= me |-ter-\Vani'd] | melerel0 lemme] alii m=(a me) osss-1a/-ll(e)amelmaat=merelarelii(e)ame)i 
the cat is made the quantum-mechanical description of the cat must include both “‘cat alive” 
Felalo Immer-\me(=r-(e Mamerelan) eleal- 1a 1kn 
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FIGURE 5.1 (a) A light wave in air is 
incident on a slab of glass, showing 
transmitted and reflected waves at the 
two boundaries (A and B). (b) A sur- 
face wave in water incident on a region 
of smaller depth similarly has trans- 
mitted and reflected waves. (c) The 
de Broglie waves of electrons moving 
from a region of constant zero poten- 
tial to a region of constant negative 
potential Vp also have transmitted and 
reflected components. 


The future behavior of a particle in a classical (nonrelativistic, nonquantum) 
situation may be predicted with absolute certainty using Newton’s laws. If a 
particle interacts with its environment through a known force F (which might 
be associated with a potential energy U), we can do the mathematics necessary 
to solve Newton’s second law, F= dp/dt (a second-order, linear differential 
equation), and find the particle’s location r(t) and velocity v(¢) at all future times 
t. The mathematics may be difficult, and in fact it may not be possible to solve the 
equations in closed form (in which case an approximate solution can be obtained 
with the help of a computer). Aside from any such mathematical difficulties, the 
physics of the problem consists of writing down the original equation F= dp/dt 
and interpreting its solutions r(¢) and v(t). For example, a satellite or planet 
moving under the influence of a 1/r? gravitational force can be shown, after the 
equations have been solved, to follow exactly an elliptical path. 

In the case of nonrelativistic quantum physics, the basic equation to be solved 
is a second-order differential equation known as the Schrodinger equation. Like 
Newton’s laws, the Schrodinger equation is written for a particle interacting with 
its environment, although we describe the interaction in terms of the potential 
energy rather than the force. Unlike Newton’s laws, the Schrédinger equation 
does not give the trajectory of the particle; instead, its solution gives the wave 
function of the particle, which carries information about the particle’s wavelike 
behavior. In this chapter we introduce the Schrodinger equation, obtain some of its 
solutions for certain potential energies, and learn how to interpret those solutions. 


5.1 BEHAVIOR OF A WAVE AT A BOUNDARY 


In studying wave motion, we often must analyze what occurs when a wave 
moves from one region or medium to a different region or medium in which the 
properties of the wave may change. For example, when a light wave moves from 
air into glass, its wavelength and the amplitude of its electric field both decrease. 
At every such boundary, a portion of the incident wave intensity is transmitted 
into the second medium and a portion is reflected back into the first medium. 

Let’s consider the case of a light wave incident on a glass plate, as in 
Figure 5.la. At boundary A, the light wave moves from air (region 1) into glass 
(region 2), while at B the light wave moves from glass into air (region 3). The 
wavelength in air in region 3 is the same as the original wavelength of the incident 
wave in region 1, but the amplitude in region 3 is less than the amplitude in 
region |, because some of the intensity is reflected at A and at B. 

Other types of waves show similar behavior. For example, Figure 5.15 shows 
a surface water wave that moves into a region of shallower depth. In that region, 
its wavelength is smaller (but its amplitude is larger) compared with the original 
incident wave. When the wave enters region 3, in which the depth is the same as 
in region 1, the wavelength returns to its original value, but the amplitude of the 
wave is smaller in region 3 than in region | because some of the intensity was 
reflected at the two boundaries. 

The same type of behavior occurs for de Broglie waves that characterize 
particles. Consider, for example, the apparatus shown in Figure 5.1c. Electrons 
are incident from the left and move inside a narrow metal tube that is at ground 
potential (V = 0). Another narrow tube in region 2 is connected to the negative 
terminal of a battery, which maintains it at a uniform potential of —Vp). Region 3 
is connected to region | at ground potential. The gaps between the tubes can in 
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principle be made so small that we can regard the changes in potential at A and B as 
occurring suddenly. In region 1, the electrons have kinetic energy K, momentum 
p = V2mK, and de Broglie wavelength 7 = h/p. In region 2, the potential energy 
for the electrons is U = gV = (—e)(—Vp) = +eVo. We assume that the original 
kinetic energy of the electrons in region | is greater than eV, so that the electrons 
move into region 2 with a smaller kinetic energy (equal to K — eV), a smaller 
momentum, and thus a greater wavelength. When the electrons move from region 
2 into region 3, they gain back the lost kinetic energy and move with their original 
kinetic energy K and thus with their original wavelength. As in the case of the 
light wave or the water wave, the amplitude of the de Broglie wave in region 3 
is smaller than in region |, meaning that the current of electrons in region 3 is 
smaller than the incident current, because some of the electrons are reflected at 
the boundaries at A and B. 

We can thus identify a total of 5 waves moving in the three regions: (1) a wave 
moving to the right in region | (the incident wave); (2) a wave moving to the left 
in region | (representing the net combination of waves reflected from boundary A 
plus waves reflected from boundary B and then transmitted through boundary A 
back into region 1); (3) a wave moving to the right in region 2 (representing waves 
transmitted through boundary A plus waves reflected at B and then reflected again 
at A); (4) a wave moving to the left in region 2 (waves reflected at B); and (5) 
a wave moving to the right in region 3 (the transmitted waves at boundary B). 
Because we are assuming that waves are incident from region 1, it is not possible 
to have a wave moving to the left in region 3. 


Penetration of the Reflected Wave 


Another property of classical waves that carries over into quantum waves is 
penetration of a totally reflected wave into a forbidden region. When a light 
wave is completely reflected from a boundary, an exponentially decreasing wave 
called the evanescent wave penetrates into the second medium. Because 100% 
of the light wave intensity is reflected, the evanescent wave carries no energy 
and so cannot be directly observed in the second medium. But if we make the 
second medium very thin (perhaps equal to a few wavelengths of light) the light 
wave can emerge on the opposite side of the second medium. We’ll discuss this 
phenomenon in more detail at the end of this chapter. 

The same effect occurs with de Broglie waves. Suppose we increase the battery 
voltage in Figure 5.1c so that the potential energy in region 2 (equal to eV) 
is greater than the initial kinetic energy in region |. The electrons do not have 
enough energy to enter region 2 (they would have negative kinetic there) and so 
all electrons are reflected back into region 1. 

Like light waves, de Broglie waves can also penetrate into the forbidden region 
with exponentially decreasing amplitudes. However, because de Broglie waves 
are associated with the motion of electrons, that means that electrons must also 
penetrate a short distance into the forbidden region. The electrons cannot be 
directly observed in that region, because they have negative kinetic energy there. 
Nor can we do any experiment that would reveal their “real” existence in the 
forbidden region, such as measuring the speed of their passage through that region 
or detecting the magnetic field that their motion might produce. 

One explanation for the penetration of the electrons into the forbidden region 
relies on the uncertainty principle—because we can’t know exactly the energy of 
the incident electrons, we can’t say with certainty that they don’t have enough 
kinetic energy to penetrate into the forbidden region. For short enough time Az, the 
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FIGURE 5.2 = =(a) A discontinuous 
wave. (b) A continuous wave with 
a discontinuous slope. (c) Two sine 
waves join smoothly. (d) A sine wave 
and an exponential join smoothly. 
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energy uncertainty AE ~ fi/At might allow the electron to travel a short distance 
into the forbidden region, but this extra energy does not “belong to” the electron 
in any permanent sense. Later in this chapter we’ll discuss a more mathematical 
approach to this explanation of penetration into the forbidden region. 


Continuity at the Boundaries 


When a wave such as a light wave or a water wave crosses a boundary as in 
Figure 5.1, the mathematical function that describes the wave must have two 
properties at each boundary: 


1. The wave function must be continuous. 
2. The slope of the wave function must be continuous, except when the boundary 
height is infinite. 


Figure 5.2a shows a discontinuous wave function; the wave displacement 
changes suddenly at a single location. This type of behavior is not allowed. 
Figure 5.2b shows a continuous wave function (there are no gaps) with a 
discontinuous slope. This type of behavior is also not allowed, unless the 
boundary is of infinite height. Figures 5.2c, d show how two sine curves and an 
exponential and a sine can be joined so that both the function and the slope are 
continuous. 

Across any non-infinite boundary, the wave must be smooth—no gaps in the 
function and no sharp changes in slope. When we solve for the mathematical 
form of a wave function, there are usually undetermined parameters, such as 
the amplitude and phase of the wave. In order to make the wave smooth at 
the boundary, we obtain the values of those coefficients by applying the two 
boundary conditions to make the function and its slope continuous. For example, 
at boundary 4 in Figure 5.1, we first evaluate the total wave function in region 1 
at A and set it equal to the wave function in region 2 at A. This guarantees that the 
total wave function is continuous at A. We then take the derivative of the wave 
function in region 1, evaluate it at A, and set that equal to the derivative of the 
wave function in region 2 evaluated at A. This step makes the slope in region 1| 
match the slope in region 2 at boundary A. These two steps give us two equations 
relating the parameters of the waves and allow us to find relationships between 
the amplitudes and phases of the waves in regions | and 2. The process must 
be repeated at every boundary, such as at B in Figure 5.1 to match the waves in 
regions 2 and 3. 

We can understand the exception to the continuity of the slope for infinite 
boundaries with an example from classical physics. Imagine a ball dropped from 
a height y= H above a stretched rubber sheet at y = 0. The ball falls freely 
under gravity until it strikes the sheet, which we assume behaves like an elastic 
spring. The sheet stretches as the ball is brought to rest, after which the restoring 
force propels the ball upward. The motion of the ball might be represented by 
Figure 5.3. Above the sheet (y > 0) the motion is represented by parabolas, and 
while the ball is in contact with the sheet (v < 0) the motion is described by sine 
curves. Note how the curves join smoothly at y = 0, and note how both y(¢) and 
its derivative v(t) are continuous. 

On the other hand, imagine a ball hitting a steel surface, which we assume to be 
perfectly rigid. The ball rebounds elastically, and at the instant it is in contact with 
the surface its velocity reverses direction. The motion of the ball is represented 
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in Figure 5.4. At the points of contact with the surface, there is a sudden change 
in the velocity, corresponding to an infinite acceleration and thus to an infinite 
force. The function y(t) is continuous, but its slope is not—the function has no 
gaps, but it does have sharp “points” where the slope changes suddenly. 

The assumption of the perfectly rigid surface is an idealization that we make 
to help us understand the situation and also to help simplify the mathematics. In 
reality the steel surface will flex slightly and ultimately behave somewhat like 
a much stiffer version of the rubber sheet. In quantum mechanics we will also 
sometimes use an assumption of a perfectly rigid or impenetrable boundary to 
help us understand and simplify the analysis of a more complicated physical 
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situation. 


In this section we have established several properties of classical waves that 


also apply to quantum waves: 


1. When a wave crosses a boundary between two regions, part of the wave 


intensity is reflected and part is transmitted. 
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FIGURE 5.4 The position and veloc- 
ity of a ball dropped from a height 1 
above a rigid surface. 


2. When a wave encounters a boundary to a region from which it is forbidden, 
the wave will penetrate perhaps by a few wavelengths before reflecting. 

3. Ata finite boundary, the wave and its slope are continuous. At an infinite 
boundary, the wave is continuous but its slope is discontinuous. 


J Example 5.1 





In the geometry of Figure 5.1, the wave in region | is 
given by y, (x) = C)sin(2zx/i, — $,), where C, = 11.5, 
A, = 4.97cm, and ¢, = —65.3°. In region 2, the wave- 
length is A, = 10.5 cm. The boundary 4A is located at x = 0, 
and the boundary B is located at x = L, where L = 20.0cm. 
Find the wave functions in regions 2 and 3. 


Solution 

The general form of the wave in region 2 can be repre- 
sented in a form similar to that of the wave in region 1: 
yo(x) = C, sin(27rx/A, — b,). To find the complete wave 
function in region 2, we must find the amplitude C, and 
the phase ¢, by applying the boundary conditions on 
the function and its slope at boundary A (x = 0). Setting 
v(x = 0) = yy (x = 0) gives 


—C, sing, = —C, sing, 
The slopes can be found from the derivative of the gen- 
eral form dy/dx = (27 /A)C cos(27x/A — @) evaluated at 
x=0: 


20 20 
a, cos; = we cos p> 


Dividing the first equation by the second eliminates C, 
and allows us to solve for @): 


A 
oo = tan”! (# tang ) 
2 


4.97 cm 
= tan! tan(—65.3° 
oT ( cm aig )) 





= —45.8° 


We can solve for C, using the result from applying the first 
boundary condition: 
G=c, sing, | W 5 Sin(—65.3") S 


- = 11.5—__1__= 14.6 
sin @) sin(—45.8°) 





To find the wave function in region 3, which we assume 
to have the same form y3(x) = C3 sin(27x/A, — $3), we 
must apply the boundary conditions on y, and y; atx = L. 
Applying the two boundary conditions in the same way we 
did at x = 0, we obtain 


_ (2b . (2nL 
Cy sin) —— 2 = C; sh 3 
a) Ay 


20 C 2nL Fs 20 C 2nL é 
— cos | — — = —— cos | — — 
By dy 3 i dy : 


Proceeding as we did before, we divide these two 
equations to find ¢; = 60.9°, and then from either 
equation obtain C; = 7.36. Our two solutions are 


138 Chapter 5 | The Schrédinger Equation 





then y>(x) = 14.6 sin (27x/10.5 + 45.8°) and y3(x) = 
7.36 sin(27x/4.97 + 14.6°), with x measured in cm. 
Figure 5.5 shows the wave in all three regions. Note 
how the waves join smoothly at the boundaries. 

How is it possible that the amplitude of, can be greater 
than the amplitude of y,? Keep in mind that y, represents 
the total wave in region |, which includes the incident wave 
and the reflected wave. Depending on the phase difference 
between them, when the incident and reflected waves are 
added to obtain y,, the amplitude of the resultant can be 
smaller than the amplitude of either wave. 

















FIGURE 5.5 Example 5.1. 


5.2 CONFINING A PARTICLE 


A free particle (that is, a particle on which no forces act anywhere) is by definition 
not confined, so it can be located anywhere. It has, as we discussed in Chapter 4, 
a definite wavelength, momentum, and energy (for which we can choose any 
value). 

A confined particle, on the other hand, is represented by a wave packet that 
makes it likely to be found only in a region of space of size Ax. We construct 
such a wave packet by adding together different sine or cosine waves to obtain 
the desired mathematical shape. 

In quantum mechanics, we often want to analyze the behavior of confined 
particles, for example an electron that is attached to a specific atom or molecule. 
We’ll consider the properties of atomic electrons beginning in Chapter 6, but for 
now let’s look at a simpler problem: an electron moving in one dimension and 
confined by a series of electric fields. Figure 5.6 shows how the apparatus of 
Figure 5.1c might be modified for this purpose. The center section is grounded 
(so that V = 0) and the two side sections are connected to batteries so that they 
are at potentials of — Vp relative to the center section. As before, we assume that 

—e the gaps between the center section and the side sections can be made as narrow 

( e-_& @| as possible, so we can regard the potential energy as changing instantaneously at 

A B the boundaries A and B. This arrangement is often called a potential energy well. 

| | The potential energy of an electron in this situation is then 0 in the center 

Vo = Vo section and Uy) = qV = (—e)(—Vp) = +eV in the two side sections as shown in 

(a) Figure 5.6. To confine the electron, we want to consider cases in which it moves 

in the center section with a kinetic energy K that is less than Up. For example, the 

U=Uo electron might have a kinetic energy of 5eV in the center section, and the side 

sections might have potential energies of 10 eV. The electron thus does not have 

enough energy to “climb” the potential energy hill between the center section and 

the side sections, and (at least from the classical point of view) the electron is 
(6) confined to the center section. 

We’ll discuss the full solution to this problem later in this chapter, but for now 

FIGURE 5.6 (a) Apparatus for con-  let’s simplify even further and consider the case of an infinitely high potential 

fining an electron to the center region energy barrier at A and B. This is a good approximation to the situation in 

of length L. (6) The potential energy which the kinetic energy of the electron in the center section is much smaller 

of an electron in this apparatus. than the potential energy supplied by the batteries. In this case the penetration 



































into the forbidden region, which we discussed in Section 5.1, cannot occur. The 
probability to find the electron in either of the side regions is therefore precisely 
zero everywhere in those regions, and thus the wave amplitude is zero everywhere 
in those regions, including at the boundaries (locations A and B). For the wave 
function to be continuous, the wave function in the center section must have 
values of zero at A and B. 

Of all the possible waves that might be used to describe the particle in this 
center section, the continuity condition restricts us to waves that have zero 
amplitude at the boundaries. Some of those waves are illustrated in Figure 5.7. 
Note that the wave function is continuous, but its slope is not (there are sharp 
points in the function at locations A and B). This is an example of the exception 
to the second boundary condition—the slope may be discontinuous at an infinite 
barrier. 

In contrast to the free particle for which the wavelength could have any value, 
only certain values of the wavelength are allowed. The de Broglie relationship then 
tells us that only certain values of the momentum are allowed, and consequently 
only certain values of the energy are allowed. The energy is not a continuous 
variable, free to take on any arbitrary value; instead, the energy is a discrete 
variable that is restricted to a certain set of values. This is known as quantization 
of energy. 

You can see directly from Figure 5.7 that the allowed wavelengths are 
2L,L,2L/3,..., where L is the length of the center section. We can write these 
wavelengths as 
n=1,2,3),... (5.1) 
This set of wavelengths is identical to the wavelengths of the classical problem 
of standing waves on a string stretched between two points. From the de Broglie 
relationship A = h/p we obtain 


h 


Pr =n 


oF (5.2) 


The energy of the particle in the center section is only kinetic energy p?/2m, 
and so 


2 

LL =n 8mi2 
These are the allowed or quantized values of the energy of the electron. 

A wave packet describing the electron in this region must be a combination of 
waves with the allowed values of the wavelengths. However, it is not necessary 
to construct a wave packet from a combination of waves to describe this confined 
particle. Even a single one of these waves represents the confined particle, because 
the wave function must be zero in the forbidden regions. So the waveforms shown 
in Figure 5.7 can represent wave packets of this confined electron, each wave 
packet consisting of only a single wave. 

The appearance of energy quantization accompanies every attempt to confine 
a particle to a finite region of space. Quantization of energy is one of the principal 
features of the quantum theory, and studying the quantized energy levels of 
systems (such as by observing the energies of emitted photons) is an important 
technique of experimental physics that gives us information about the properties 
of atoms and nuclei. 


(5.3) 
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FIGURE 5.7 Some possible waves 
that might be used to describe an elec- 
tron confined by an infinite potential 
energy barrier to a region of length L. 
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Applying the Uncertainty Principle 
to a Confined Particle 


In Chapter 4 we constructed wave packets and showed how the uncertainty 
principle related the size of the wave packet to the range of wavelengths that 
was used in its construction. Let’s now see how the Heisenberg uncertainty 
relationships apply in the case of a confined particle. 

In the arrangement of Figure 5.6 (with infinitely high barriers on each side), 
the particle is known to be somewhere in the center section of the apparatus, and 
thus Ax ~ L is a reasonable estimate of the uncertainty in its location. To find the 
uncertainty in its momentum, we use the rigorous definition of uncertainty given 
in Eq. 4.15: Ap, = V/(p2)ay — (Pyay)* » The particle moving in the center section 
can be considered to be moving to the left or to the right with equal probability 
(just as the classical standing-wave problem can be analyzed as the superposition 
of identical waves moving to the left and to the right). Thus p,., = 0. If the 
particle is moving with a momentum given by Eq. 5.2, p2 = (nh/L)* and so 
Ap, = nh/L. Combining the uncertainties in position and momentum, we have 


AxAp, ~ L—- = nh (5.4) 


The product of the uncertainties is certainly greater than 4/2, and so the result 
of confining the particle is entirely consistent with the Heisenberg uncertainty 
relationship. Note that even the smallest possible value of the product of the 
uncertainties (which is obtained for n = 1) is still much larger than the minimum 
value given by the uncertainty principle. 

Later in this chapter, we will use a more rigorous way to evaluate the uncertainty 
in position using a formula similar to Eq. 4.15 to find the uncertainty in position, 
and we will find that the result does not differ very much from the estimate 
of Eq. 5.4. 


5.3 THE SCHRODINGER EQUATION 


The differential equation whose solution gives us the wave behavior of particles is 
called the Schrédinger equation. It was developed in 1926 by Austrian physicist 
Erwin Schrédinger. The equation cannot be derived from any previous laws 
or postulates; like Newton’s equations of motion or Maxwell’s equations of 
electromagnetism, it is a new and independent result whose correctness can 
be determined only by comparing its predictions with experimental results. 
For nonrelativistic motion, the Schrédinger equation gives results that correctly 
account for observations at the atomic and subatomic level. 

We can justify the form of the Schrédinger equation by examining the solution 
Erwin Schrodinger (1887-1961, Aus- expected for the free particle, which should give a wave whose shape at any 
tria). Although he disagreed with the particular time, specified by the wave function (x), is that of a simple de 
probabilistic interpretation that was Broglie wave, such as yx) = Asin kx, where A is the amplitude of the wave and 
later given to his work, he devel- % = 277/A. If we are looking for a differential equation, then we need to take some 
oped the mathematical theory of wave derivatives: 
mechanics that for the first time per- 5 
mitted the wave behavior of physical ab = kAcoskx ay = —K’Asinkx = —k’ (x) 
systems to be calculated. dx ; dx? 
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Note that the second derivative gives the original function again. With the kinetic 
energy K = p?/2m = (h/d)*/2m = hk? /2m, we can then write 


ey Re _ 2m _ 2m yy 

qe WO) = a2 YQ) = — 52 ( — U)Wx) 

where E = K + U is the nonrelativistic total energy of the particle. For a free 
particle, UV = 0 so E = K; however, we are using the free particle solution to try 
to extend to the more general case in which there is a potential energy U(x). The 
equation then becomes 


Ir ew 
—~— <F + Uwe) = Ex) (5.5) 
2m dx 
Equation 5.5 is the time-independent Schrédinger equation for one-dimensional 
motion. 
The solution to Eq. 5.5 gives the shape of the wave at time ¢=0. The 
mathematical function that describes a one-dimensional traveling wave must 
involve both x and f. This wave is represented by the function W(x,f): 


Wx,t) = Waxje (5.6) 


The time dependence is given by the complex exponential function e with 
w = E/h. (You can find a few useful formulas involving complex numbers in 
Appendix B.) We’II discuss the time-dependent part later in this chapter. For now, 
we’ll concentrate on the time-independent function w(x). 

We assume that we know the potential energy U(x), and we wish to obtain the 
wave function (x) and the energy E for that potential energy. This is a general 
example of a type of problem known as an eigenvalue problem; we find that it is 
possible to obtain solutions to the equation only for particular values of E, which 
are known as the energy eigenvalues. 

The general procedure for solving the Schrédinger equation is as follows: 


1. Begin by writing Eq. 5.5 with the appropriate U(x). Note that if the potential 
energy changes discontinuously [U(x) may be represented by a discontinuous 
function; W(x) may not], we may need to write different equations for different 
regions of space. Examples of this sort are given in Section 5.4. 

2. Using general mathematical techniques suited to the form of the equation, find 
a mathematical function W(x) that is a solution to the differential equation. 
Because there is no one specific technique for solving differential equations, 
we will study several examples to learn how to find solutions. 

3. In general, several solutions may be found. By applying boundary conditions 
some of these may be eliminated and some arbitrary constants may be 
determined. It is generally the application of the boundary conditions that 
selects out the allowed energies. 

4. Ifyouare seeking solutions for a potential energy that changes discontinuously, 
you must apply the continuity conditions on w(x) (and usually on dw/dx) at 
the boundary between different regions. 


Because the Schrédinger equation is linear, any constant multiplying a solution 
is also a solution. The method to determine the amplitude of the wave function is 
discussed in the next section. 
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FIGURE 5.8 The probability to find 
the particle in a small region of width 
dx is equal to the area of the strip under 
the |y(x)|? curve. The total probability 
to find the particle between x, and x, is 
the sum of the areas of the strips, equal 
to the integral between those limits. 


The Schrédinger Equation 


Probabilities and Normalization 


The remaining steps in the procedure for applying the Schrodinger recipe depend 
on the physical interpretation of the solution to the differential equation. Our 
original goal in solving the Schrédinger equation was to obtain the wave properties 
of the particle. What does the amplitude of W(x) represent, and what is the physical 
variable that is waving? It is certainly not a displacement, as in the case of a water 
wave or a wave on a stretched piano wire, nor is it a pressure wave, as in the case 
of sound. /t is a very different kind of wave, whose squared absolute amplitude 
gives the probability for finding the particle in a given region of space. 

If we define P(x) as the probability density (probability per unit length, in one 
dimension), then according to the Schrédinger recipe 


P(x) dx = |W(x)|* dx (5.7) 


as indicated in Figure 5.8. In Eq. 5.7, |y(x)? dx gives the probability to find the 
particle in the interval dx at x (that is, between x and x + dx).* Because the wave 
function W(x) might be a complex function, it is necessary to square its absolute 
magnitude to make sure that the probability is a positive real number. 

The squared magnitude of the general time-dependent wave function 
(Eq. 5.6) is: 


MOD)? = Wee"? = Wool? (5.8) 


where the last step can be taken because the magnitude of the time-dependent 
factor is 1. For this reason, the probability density associated with a solution to 
the Schrédinger equation (for any allowed value of £) is independent of time. 
These special quantum states are called stationary states. 

This interpretation of |y(x)|? helps us to understand the continuity condition 
of yx). We must not allow the probability to change discontinuously, but, like 
any well-behaved wave, the probability to locate the particle varies smoothly and 
continuously. 

This interpretation of W(x) now permits us to complete the Schrodinger recipe 
and to illustrate how to use the wave function to calculate quantities that we can 
measure in the laboratory. Steps 1 through 4 were given previously; the recipe 
continues: 


5. Fora wave function describing a single particle, the probability summed over 
all locations must give 100%—\that is, the particle must be located somewhere 
between x = —oo and x = +co. The probability to find the particle in a small 
interval was given in Eq. 5.7. The total probability to find the particle in all 
such intervals must be exactly 1: 


+00 
[ woorac=1 (5.9) 
—0o 

The Schrédinger equation is linear, which means that if W(x) is a solution 
then any constant times w(x) is also a solution. For the probability to be a 
meaningful concept, this constant must be chosen so that Eq. 5.9 is satisfied. 


*It is not correct to speak of “the probability to find the particle at the point x.” A single point is a 
mathematical abstraction with no physical dimension. The probability of finding a particle at a point 
is zero, but there can be a nonzero probability of finding the particle in an interval. 
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A wave function with its multiplicative constant chosen in this way is said to 
be normalized, and Eq. 5.9 is known as the normalization condition. 
Because the solution to the Schrédinger equation represents a probability, 
any solution that becomes infinite must be discarded—it makes no sense 
to have an infinite probability to find a particle in any interval. In practice, 
we “discard” a solution by setting its multiplicative constant equal to zero. 
For example, if the mathematical solution to the differential equation yields 
W(x) = Ae™ + Be~™ for the entire region x > 0, then we must require A = 0 
for the solution to be physically meaningful; otherwise |y(x)|? would become 
infinite as x goes to infinity. On the other hand, if this solution is to be valid 
in the entire region x < 0, then we must set B = 0. However, if the solution is 
to be valid only in a small portion of the range of x—say, 0 < x < L—then 
we cannot set either A = 0 or B = 0. 

Suppose the interval between two points x, and x, 1s divided into a series of 
infinitesimal intervals of width dx (Figure 5.8). To find the total probability for 
the particle to be located between x, and x7, which we represent as P(x,: x2), 
we calculate the sum of all the probabilities P(x) dx in each interval dx. This 
sum can be expressed as an integral: 


x2 x2 
Poi) = / P(x) dx = / |Wex)|? de (5.10) 
xX] xX] 

If the wave function has been properly normalized, Eq. 5.10 will always yield 
a probability that lies between 0 and 1. 

Because we can no longer speak with certainty about the position of the 
particle, we can no longer guarantee the outcome of a single measurement of 
any physical quantity that depends on its position. Instead, we can find the 
average outcome of a large number of measurements. For example, suppose 
we wish to find the average location of a particle by measuring its coordinate 
x. From a large number of measurements, we find the value x, a certain 
number of times 1,,x, a number of times 7, etc., and in the usual way we 
can find the average value 


MX] MX +++ Dong; 
a i la din 





i (5.11) 
The number of times n; that we measure each x; is proportional to the 
probability P(x,;)dx to find the particle in the interval dx at x,. Making this 
substitution and changing the sums to integrals, we have 


-++00 
/ P(x)x dx oe 
a | w(x) |x dx (5.12) 


av +oo 
/ P(x) dx — 


—oo 


where the last step can be made if the wave function is normalized, in which 
case the denominator of Eq. 5.12 is equal to one. 
By analogy, the average value of any function of x can be found: 


+00 +00 
FO)lay = / P(@)f (x) dx = / WPF (x) dx (5.13) 


Average values calculated according to Eq. 5.12 or 5.13 are known as 
expectation values. 
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5.4 APPLICATIONS OF THE SCHRODINGER 





EQUATION 


Solutions for Constant Potential Energy 


First let’s examine the solutions to the Schrodinger equation for the special case 
of a constant potential energy, equal to Up. Then Eq. 5.5 becomes 


i ew 
2m dx? 


or (assuming for now that E > Up) 


+ UpW(x) = Ew(x) (5.14) 


ay 2m(E — Ug) 
The parameter k in this equation is equal to the wave number 277/2. 
The solution to Eq. 5.15 is a function of x that, when differentiated twice, gives 
back the original function multiplied by the negative constant —k*. The function 
that has this property is sin kx or cos kx. The most general solution to the equation is 


(5.15) 


W(x) = Asinkx + Boos kx (5.16) 


The constants A and B must be determined by applying the continuity and 
normalization requirements. We can demonstrate that Eq. 5.16 satisfies Eq. 5.15 
by taking two derivatives: 


ld = kAcoskx — kB sinkx 
dx 

aw . 2 : 

ee = —k’Asinkx — kK’ Bcoskx = =k (A sinkx + Bcos kx) = —k W(x) 
IX” 


so the original equation is indeed satisfied. 

To analyze the penetration of a particle into a forbidden region, we must 
consider the case in which the energy E of the particle is smaller than the potential 
energy Up. For this case we can rewrite Eq. 5.14 as 


Pp iy pe re) 
Te = k(x) with k= ar (5.17) 


In this case the general solution in the forbidden regions is 
Wx) = Ae*™ + BeW#* (5.18) 


Once again, we can demonstrate that Eq. 5.18 is a solution of Eq. 5.17 by taking 
two derivatives: 
d , 
a =k’ Ae** — h Be** 
dx 
ay p24 Kx 2p —k'x 2 kx —Kk'x /2 
Ga a Ae +k Be-** = kK" (Ae** + Be“"*) =k’ W(x) 
Ix 


We will use Eqs. 5.16 and 5.18 as our solutions to the Schrédinger equation for 
constant potential energy in the allowed (E > Uj) and forbidden (E < U ) regions. 
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The Free Particle 


For a free particle, the force is zero and so the potential energy is constant. We 
may choose any value for that constant, so for convenience we’ll choose Up) = 0. 
The solution is given by Eq. 5.16, W(x) = Asinkx + Bcoskx. The energy of the 
particle is 


hk? 
E _ 


~ 2m 


(5.19) 


Our solution has placed no restrictions on k, so the energy is permitted to have 
any value (in the language of quantum physics, we say that the energy is not quan- 
tized). We note that Eq. 5.19 is the kinetic energy of a particle with momentum 
p =hNhik or, equivalently, p = h/d. This is as we would have expected, because 
the free particle can be represented by a de Broglie wave with any wavelength. 

Solving for A and B presents some difficulties because the normalization 
integral, Eq. 5.9, cannot be evaluated from —co to +00 for this wave function. 
We therefore cannot determine probabilities for the free particle from the wave 
function of Eq. 5.16. 

It is also instructive to write the wave function in terms of complex exponentials, 
using sin kx = (e* — e~*) /2i and cos kx = (e™ + e-**)/2: 


ike _ ike ike 1 o—ike 
Wx) =A (—) +B (—-] = A’el 4 ple (5.20) 
1 


where A’ = A/2i + B/2 and B’ = —A/2i + B/2.To interpret this solution in terms 
of waves we form the complete time-dependent wave function using Eq. 5.6: 


W (x,t) = (4’'a™ A Koes = A elo) = Ble ikt on (5.21) 


The dependence of the first term on kx — wt identifies this term as representing 
a wave moving to the right (in the positive x direction) with amplitude 4’, and 
the second term involving kx + wt represents a wave moving to the left (in the 
negative x direction) with amplitude B’. 

If we want the wave to represent a beam of particles moving in the +x direction, 
then we must set B’ = 0. The probability density associated with this wave is 
then, according to Eq. 5.7, 


P(x) = |W)? = 4’ Pee™ = |’? (5.22) 


The probability density is constant, meaning the particles are equally likely to be 
found anywhere along the x axis. This is consistent with our discussion of the free- 
particle de Broglie wave in Chapter 4—a wave of precisely defined wavelength 
extends from x = —oo to x = +00 and thus gives a completely unlocalized 
particle. 


Infinite Potential Energy Well 


Now we’ll consider the formal solution to the problem we discussed in Section 5.2: 
a particle is trapped in the region between x = 0 and x = L by infinitely high 
potential energy barriers. Imagine an apparatus like that of Figure 5.6, in which the 
particle moves freely in this region and makes elastic collisions with the perfectly 
rigid barriers that confine it. This problem is sometimes called “a particle in a 
box.” For now we’ll assume that the particle moves in only one dimension; later 
we’ ll expand to two and three dimensions. 
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FIGURE 5.9 The potential energy ofa 
particle that moves freely (U = 0) in 
the region 0 < x < L but is completely 
excluded (U = oc) from the regions 
x < Oandx>L. 


The potential energy may be expressed as: 
U(x) = 0 O<x<L 
=o x<0,x>L (5.23) 


The potential energy is shown in Figure 5.9. We are free to choose any constant 
value for U in the region 0 < x < L; we choose it to be zero for convenience. 

Because the potential energy is different in the regions inside and outside the 
well, we must find separate solutions in each region. We can analyze the outside 
region in either of two ways. If we examine Eq. 5.5 for the region outside the well, 
we find that the only way to keep the equation from becoming meaningless when 
U > o is to require wy = 0, so that Uw will not become infinite. Alternatively, 
we can go back to the original statement of the problem. If the walls at the 
boundaries of the well are perfectly rigid, the particle must always be in the well, 
and the probability for finding it outside must be zero. To make the probability 
zero everywhere outside the well, we must make yw = 0 everywhere outside. Thus 
we have 


W(x) = 0 x<0,x>L (5.24) 


The Schrédinger equation for 0 < x < L, when U(x) = 0, is identical with Eq. 5.14 
with Uy = 0 and has the same solution: 


W(x) = Asinkx + Bcos kx O0<x<L (5.25) 


2mE 
k= (5.26) 


Our solution is not yet complete, for we have not evaluated A or B, nor have 
we found the allowed values of the energy £. To do this, we must apply the 
requirement that y(x) is continuous across any boundary. In this case, we require 
that our solutions for x < 0 and x > 0 match up at x = 0; similarly, the solutions 
for x > Zand x < L must match at x = L. 

Let us begin at x = 0. Atx < 0, we have found that w = 0, and so we must set 
W(x) of Eq. 5.25 to zero at x = 0. 


with 


w(0) = Asin0+ Bcos0 = 0 (5.27) 


which gives B = 0. Because yw = 0 for x > L, the second boundary condition is 
WL) = 0, so 


W(L) = AsinkL + BcoskL = 0 (5.28) 


We have already found B=0, so we must now have AsinkL = 0. Either 
A= 0, in which case y = 0 everywhere, ? = 0 everywhere, and there is no 
particle (a meaningless solution) or else sinkL = 0, which is true only when 
kL=1,27,37,..., or 


kL=nn  n=1,2,3,... (5.29) 
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With k = 277/A, we have A = 2L/n; this is identical with the result obtained in 
introductory mechanics for the wavelengths of the standing waves in a string of 
length Z fixed at both ends, which we already obtained in Section 5.2 (Eq. 5.1). 
Thus the solution to the Schrodinger equation for a particle trapped in a linear 
region of length L is a series of standing de Broglie waves! Not all wavelengths 
are permitted; only certain values, determined from Eq. 5.29, may occur. 

From Eq. 5.26 we find that, because only certain values of k are permitted by 
Eq. 5.29, only certain values of E may occur—the energy is quantized! Solving 
Eq. 5.29 for k and substituting into Eq. 5.26, we obtain 


n=4| ——————— | £, = 16£y 


n=3 | —————— | 8, = 9, 


n= 2 | ——————_ | £ = 4, 





hk2 hen n2 h2n2 
= = n= 1,2,3,... (5.30) 


n—_ _ 2 -_ 2 
2m 2mL 8mL oA E, =Eo 








For convenience, let Ey = f’2?/2mL* = h?/8mL?; this unit of energy is 
determined by the mass of the particle and the width of the well. Then E,, = n7Ep, FIGURE 5.10 The first four energy 
and the only allowed energies for the particle are Ey,4Ey,9Ey, 16Ep, etc. All levels in a one-dimensional infinite 
intermediate values, such as 3E, or 6.2Ep, are forbidden. Figure 5.10 shows the Potential energy well. 
allowed energy levels. The lowest energy state, for which n = 1, is known as the 
ground state, and the states with higher energies (n> 1) are known as excited 
States. 
Because the energy is purely kinetic in this case, our result means that only 
certain speeds are permitted for the particle. This is very different from the case 
of the classical trapped particle, in which the particle can be given any initial 
velocity and will move forever, back and forth, at the same speed. In the quantum 
case, this is not possible; only certain initial speeds can result in sustained states of 
motion; these special conditions represent the “stationary states.” Average values 
calculated according to Eq. 5.13 likewise do not change with time. 
From one energy state, the particle can make jumps or transitions to another 
energy state by absorbing or releasing an amount of energy equal to the energy 
difference between the two states. By absorbing energy the particle will move to 
a higher energy state, and by releasing energy it moves to a lower energy state. 
A similar effect occurs for electrons in atoms, in which the absorbed or released 
energy is usually in the form of a photon of visible light or other electromagnetic 
radiation. For example, from the state with n = 3 (£3 = 9£o), the particle might 
absorb an energy of AE = 7£, and jump upward to the n = 4 state (Ey = 16E)) 
or might release energy of AE = 5E, and jump downward to the n = 2 state 
(E, = 4£)). 


J Example 5.2 





Anelectron is trapped in a one-dimensional region oflength Solution 
1.00 x 10~!° m (a typical atomic diameter). (a) Find the (a) The basic quantity of energy needed for this 
energies of the ground state and first two excited states. calculation is 


(b) How much energy must be supplied to excite the elec- 2 (he)? 
tron from the ground state to the second excited state? fy Ss = 5 a 
: 8mL 8mc-L 
(c) From the second excited state, the electron drops down 5 
to the first excited state. How much energy is released in this (1240 eV - nm) = 39 6ev 





process? ~ 8(511, 000 eV)(0.100 nm)2 
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With E,, = nE, we can find the energy of the states: This is the energy that must be absorbed for the electron to 


n=1: E,=E,=37.6eV make this jump. 


n=2: E,=4E, =150.4eV 


(c) The energy difference between the second and first 
excited states is 


n=3: £3 =9EF,) = 338.4eV 


AE = Ey — Ey = 338.4eV — 150.4eV = 188.0eV 


(b) The energy difference between the ground state and 


the second excited state is 


This is the energy that is released when the electron makes 


AE = E; — E, = 338.4eV — 37.6eV = 300.8 eV this jump. 





To complete the solution for w(x), we must determine the constant A by using 
the normalization condition given in Eq. 5.9, i |W(x)|?dx = 1. The integrand 
is zero in the regions —oco < x < Oand L < x < +o, so all that remains is 


vB 
/ A sin? ae = 1 (5.31) 
A L 


from which we find 4 = ./2/L. The complete wave function for 0 <x < L is 
then 


2 
Wy, (x) = [sin B= TE ives (5.32) 


In Figure 5.11, the wave functions and probability densities y? are illustrated for 
the lowest several states. 

In the ground state, the particle has the greatest probability to be found near 
the middle of the well (x = L/2), and the probability falls off to zero between 
the center and the sides of the well. This is very different from the behavior 
of a classical particle—a classical particle moving at constant speed would be 
found with equal probability at every location inside the well. The quantum 
particle also has constant speed but yet is still found with differing probability 
at various locations in the well. It is the wave nature of the quantum particle that 
is responsible for this very nonclassical behavior. 



































FIGURE 5.11 The wave functions (solid lines) and probability densities (shaded regions) of the first four 
states in the one-dimensional infinite potential energy well. 
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Another example of nonclassical behavior occurs for the first excited state. 
The probability density has a maximum at x = L/4 and another maximum at 
x = 3L/4. Between the two maxima, there is zero probability to find the particle 
in the center of the well at x = L/2. How can the particle travel from x = L/4 to 
x = 3L/4 without ever being at x = L/2? Of course, no classical particle could 
behave in such a way, but it is a common behavior for waves. For example, 
the first overtone of a vibrating string has a node at its midpoint and antinodes 


(vibrational maxima) at the !/, and 3/, locations. 


The calculation of probabilities and average values is illustrated by the 


following examples. 





Consider again an electron trapped in a one-dimensional 
region of length 1.00 x 10~'!°m = 0.100nm. (a) In the 
ground state, what is the probability of finding the electron 
in the region from x = 0.0090 nm to 0.0110 nm? (5) In the 
first excited state, what is the probability of finding the 
electron between x = 0 and x = 0.025 nm? 


Solution 

(a) When the interval is small, it is often simpler to 
use Eq. 5.7 to find the probability, instead of using 
the integration method. The width of the small inter- 
val is dx = 0.0110 nm — 0.0090 nm = 0.0020 nm. Evalu- 
ating the wave function at the midpoint of the interval 
(x = 0.0100 nm), we can use the n = | wave function with 
Eq. 5.7 to find 


2 
P(x) de = |W (|? de = 5 sin? = dx 


_ 2 win? 70-0100 nm) 
~ 0.100 nm 0.100 nm 


= 0.0038 = 0.38% 





(0.002 nm) 


(6) For this wide interval, we must use the integration 
method to find the probability: 


x2 
P(%,:%)) = / |W (x) |? dx 


1 


2: [2 2, 
= — / sin? = dx 
Ld, fa 





Evaluating this expression using the limits x, = 0 and 
X, = 0.025 nm gives a probability of 0.25 or 25%. This 
result is of course what we would expect by inspection of 
the graph of w? for n = 2 in Figure 5.11. The interval from 
x = 0 to x = L/4 contains 25% of the total area under the 
w? curve. 





| Example 5.4 





Show that the average value of x is L/2, independent of the 
quantum state. 


Solution 
We use Eq. 5.12; because y = 0 except for 0 < x < L, the 
limits of integration are 0 and L: 


L L 
2 
ty = [Hwee dea > f sin’ ex di 


This can be integrated by parts or found in integral tables; 
the result is 

E 

2 


Xay = 


Note that, as required, this result is independent of n. Thus 
a measurement of the average position of the particle yields 
no information about its quantum state. 
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FIGURE 5.12 The potential energy of 
a particle that is confined to the region 
0 <x < L by finite barriers Up at x = 
Oandx = L. 


Let’s now look at how the uncertainty principle applies to the motion of this 
trapped particle. By solving Problems 34 and 35, you will find that the uncertainties 
in position and momentum for a particle in an infinite potential well are Ax = 
LV/1/12 — 1/2n7n? and Ap = hn/2L. The product of the uncertainties is 


jek cae 1 aie I 
xX — = —> — 
POV Wn 2Vi~ a2 


Clearly the product of the uncertainties grows as n grows. The minimum value 
occurs for n = 1, in which case AxAp = 0.090h = 0.57h. The ground state 
represents a fairly “compact” wave packet, but it is somewhat less compact than 
the minimum possible limit of 0.504 (Eq. 4.10). You can see from Figure 5.11 
how the wave becomes less compact (spreads out more) as 7 increases. Even for 
n = 2, the product of the uncertainties grows quickly to 1.67/. 








Finite Potential Energy Well 


Because the infinite potential energy well is an idealization of a technique for 
confining a particle, we should examine the solution when the barriers at the 
sides of the well are finite rather than infinite. The potential energy well can be 
described by 


Ux) = 0 0<x<L 
=U, x<0,x>L (5.33) 


and is sketched in Figure 5.12. We look for solutions in which the particle is 
confined to this well, and thus the energies that we deduce for the particle must 
be less than Up. 

The solution in the center region (between x = 0 and x = L) is exactly the 
same as it was for the infinite well (Eq. 5.25): 


W(x) = Asinkx + Boos kx (0<x<L) (5.34) 


although the values that we deduced previously for the coefficients A and B are 
not valid in this calculation. The region x < 0 is an example of a situation in 
which the energy E of the particle is less than the potential energy Uj, and so we 
must use the solution in the form of Eq. 5.18, W(x) = Ce** + De~** with K’ given 
in Eq. 5.17. This region includes x = —oo, for which the term with the coefficient 
D becomes infinite. Because we cannot allow the probability to become infinite, 
we must discard this term by setting D = 0. The solution for x < 0 is then 


Wax) = Ce’* (x < 0) (5.35) 


In the region x > L, the energy E is once again smaller than Up, and so the solution is 
also in the form of Eq. 5.18, W(x) = Fi ef 4 Ge-** Here the region now includes 
x = +00, for which the term with the coefficient F would become infinite. We 
must prevent that possibility by setting F’ = 0, so the solution in this region is 


wx) = Ge** = (x>L) (5.36) 


We now have 4 coefficients to determine (A, B, C, G) along with the energy EF. 
For this determination, we have 4 equations from the boundary conditions (the 
continuity of both y and dw/dx at both x = 0 and x = L) and one equation 
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from the normalization condition. As you might imagine, solving 5 equations 
in 5 unknowns presents a straightforward but very tedious algebraic challenge. 
Moreover, the resulting solution for the energy values cannot be obtained in 
terms of a direct equation such as Eq. 5.30, but instead must be found numerically 
by solving a transcendental equation. The result is a series of increasing energy 
values, but the number of energy values is finite rather than infinite, because the 
energy cannot be allowed to exceed the value of Up. 

As we did for the infinite potential energy well in Example 5.2, let’s consider a 
well of width L = 0.100 nm. We’I!I choose the depth of the well to be Uy = 400 eV. 
Applying the boundary conditions at x = 0 and x = L, we can eliminate all of the 
coefficients and find an equation that involves only & and k’ (both of which depend 
on the energy £). Solving that equation numerically, we find four possible values 
of the energy: EF, = 26eV, E, = 104eV, FE; = 227eV, Ey = 375 eV. Here the 
subscript just numbers the energy values, starting at the ground state; there is no 
simple functional dependence of the energies on the quantum number 7 as there 
was for the infinite well. The allowed energy levels are shown in Figure 5.13. 

The probability densities (square of the wave functions) for these four states are 
shown in Figure 5.14. In some ways they are similar to the probability densities in 
the infinite well—note that each state has n maxima in its probability density, just 
like the infinite well (see Figure 5.11). Unlike the infinite well, these probability 
densities show the property of penetration into the classically forbidden region. 
Look carefully at the continuity of the wave function and its slope at x = 0 and 
x = L; see how smoothly the sine and cosine function inside the well joins the 
exponentials in the forbidden regions. 

The energy levels of the finite well are smaller than those of the infinite well of 
the same width (38 eV, 150 eV, 338 eV, 602 eV), and the differences increase as 
we go to higher states. This is consistent with the uncertainty principle—because 
of the penetration into the forbidden region, Ax is larger for the finite well and 
thus Ap, must be smaller. As a result, the kinetic energies are smaller for the 
finite well. From Figure 5.14 we see that the penetration distance increases as we 
go up in energy, so the difference between Ax for the finite well and the infinite 
well increases and the energy discrepancy also increases. 


Up = 400 eV 
n=4| ——————_ | E, = 375 eV 
n=3 | ——————_ | £, = 227 eV 
n= 2 | ——————_ | £, = 104 ev 
n= 1|—————] £, = 26 ev 











FIGURE 5.13 The energy levels in a 
potential energy well of depth 400 eV. 
There are only four energy states in 
this well. 
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FIGURE 5.14 The probability densities of the four states in the one-dimensional potential energy well of width 


0.100 nm and depth 400 eV. 
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FIGURE 5.15 A particle moves freely 
in the two-dimensional region 0 < 
x < L,0 <y < L, but encounters infi- 
nite barriers beyond that region. 


For an energy close to the top of the well such as £4, a smaller uncertainty 
AE is necessary to reach the top of the well, giving a larger At ~ i/AE and 
thus a larger penetration distance. At the bottom of the well, the state E, requires 
much more energy to reach the top of the well and thus needs a much larger AE; 
the smaller resulting At gives a smaller penetration distance into the forbidden 
region. 


Two-Dimensional Infinite Potential 
Energy Well 


When we extend the previous calculation to two and three dimensions, the 
principal features of the solution remain the same, but an important new feature 
is introduced. In this section we show how this occurs; this new feature, known 
as degeneracy, will turn out to be very important in our study of atomic physics. 

To begin with, we need a Schrédinger equation that is valid in more than one 
dimension; our previous version, Eq. 5.5, included only one spatial dimension. If 
the potential energy is a function of x and y, we expect that w also depends on 
both x and y, and the derivatives with respect to x must be replaced by derivatives 
with respect to x and y. In two dimensions, we then have* 


he (a 2 a? W(x, y) 
ax? ay? 





) + Ux yu) =EVisy) (5.37) 


2m 


The two-dimensional potential energy well is: 


U(x, y) = 0 O<x<L0<y<L 
=o otherwise (5.38) 


The particle is confined by infinitely high barriers to the square region with the 
vertices (x,y) = (0,0), (Z,0), (Z, LZ), (0, Z), as shown in Figure 5.15. A classical 
analog might be a small disk sliding without friction on a tabletop and colliding 
elastically with walls at x = 0,x = L,y = 0, and y = L. (For simplicity, we have 
made the allowed region square; we could have made it rectangular by setting 
U =0when0<x<aand0<y<b.) 

Solving partial differential equations requires a technique more involved than 
we need to consider, so we will not give the details of the solution. We suspect 
that, as in the previous case, W(x, v) = 0 outside the allowed region, in order to 
make the probability zero there. Inside the well, we consider solutions that are 
separable; that is, our function of x and y can be expressed as the product of one 
function that depends only on x and another that depends only on y: 


voy) = fag”) (5.39) 

where the functions f and g are similar to Eq. 5.16: 
J) =Asink,x + Bcosk,x, giv) = Csink,y + Deosk,y (5.40) 
“The first two terms on the left side of this equation require partial derivatives; for well-behaved 


functions, these involve taking the derivative with respect to one variable while keeping the other 
constant. Thus if f(x,y) =x? +- xy +’, then df/dx = 2x + y and df /dy = x + 2y. 
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The wave number & of the one-dimensional problem has become the separate 
wave numbers k, for f(x) and k, for g(v). We show later how these are related. 
(See also Problem 18 at the end of this chapter.) 

The continuity condition on y(x,y) requires that the solutions inside and 
outside match at the boundary. Because y = 0 everywhere outside, the continuity 
condition then requires that yy = 0 everywhere on the boundary. That is, 


Ww0,y)=0 and wWL,y)=0  forally 
w(x,0)=0 and wx,l)=0  forallx 


In analogy with the one-dimensional problem, the condition at x = 0 gives 
f(0) = 0, which requires B = 0 in Eq. 5.40. Similarly, the condition at y = 0 
gives g(0) = 0, which requires D=0. The condition f(Z) = 0 requires that 
sink,L = 0, and thus that k,Z be an integer multiple of zr; the condition g(L) = 0 
similarly requires that X,L be an integer multiple of 7. These two integers do not 
necessarily need to be the same, so we call them n, and n,. Making all these 
substitutions into Eq. 5.39, we obtain 


n,IUX ny 
,y) =A’ sin 2— sin = 
Wx, y) sin 7 Z 





(5.41) 


where we have combined A and C into 4’. The coefficient 4’ is once again found 
by the normalization condition, which in two dimensions becomes 


/ / w'dx dy = 1 (5.42) 


dG L not 
i dy / A? sin? oo sin? —— font (5.43) 
0 0 


For our case this gives 


from which follows 


A= 2 (5.44) 
=F : 
(The solutions to this problem, which are standing de Broglie waves on a two- 
dimensional surface, are similar to the solutions of the classical problem of the 
vibrations of a stretched membrane such as a drumhead.) 
Finally, we can substitute our solution for (x, vy) back into Eq. 5.41 to find the 
energy: 
222 2 
= ran +12) = rn +n?) (5.45) 
Compare this result with Eq. 5.30. Once again we let Ey = h??/2mL? = h? /8mL? 
so that E = Eq (nz + n;). In Figure 5.16 the energies of the excited states are shown. 
You can see how different the energies are from those of the one-dimensional 
case shown in Figure 5.10. 
Figure 5.17 shows the probability density y? for several different combinations 
of the quantum numbers n, and n,,. The probability has maxima and minima, just 
like the probability in the one-dimensional problem. For example, if we gave 


(5,2) or (2,5) 
(5,1) or (1,5) 


(4,3) or (3,4) 


(4,2) or (2,4) 
(3,3) 


(4,1) or (1,4) 
(3,2) or (2,3) 


(3,1) or (1,3) 
(2,2) 


(2,1) or (1,2) 


(1,1) 


(n,,ny) 


29E 
26Eo 
25Eo 


20E9 


18£ 
17Eo 


13£p 


10E 9 
8Eo 


2Eo 
E=0 


FIGURE 5.16 The lower permitted 
energy levels of a particle confined to 
an infinite two-dimensional potential 


energy well. 
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FIGURE 5.17 The probability density for some of the lower energy levels of a particle confined to the infinite 
two-dimensional potential energy well. The individual plots are labeled with the quantum numbers (n,,7,,) and 


with the value of the energy E. 





FIGURE 5.18 A ring of iron atoms 
on a copper surface forms a “corral” 
within which the probability density 
of trapped electrons is clearly visi- 
ble. This image was obtained with 
a scanning tunneling electron micro- 
scope. (Image originally created by 
IBM Corporation.) 


the particle an energy of 8£,) and then made a large number of measurements 
of its position, we would expect to find it most often near the four points 
(x,y) = (L/4, L/4), (L/4, 3L/4), (32/4, L/4) and (32/4, 3L/4); we expect never 
to find it near x = L/2 or y= L/2. The shape of the probability density tells us 
something about the quantum numbers and therefore about the energy. Thus if we 
measured the probability density and found six maxima, as shown in Figure 5.17, 
we would deduce that the particle had an energy of 134) with n, = 2 andn, = 3, 
or else n, = 3,n, = 2. 

Recently it has become possible to photograph the probability densities of 
electrons confined in a two-dimensional region. The tip of an electron microscope 
was used to place 48 individual iron atoms on a metal surface in a ring or 
“corral” of radius 7.13 nm that formed the walls of a potential well, as shown in 
Figure 5.18. Inside the ring, the waves of probability density for electrons trapped 
in the potential well are clearly visible. The potential well is circular, rather than 
square, but otherwise the analysis follows the procedures described in this section; 
when the Schrédinger equation is solved in cylindrical polar coordinates with the 
potential energy for a circular well, the calculated probability density gives a close 
match with the observed one. These beautiful results are a dramatic confirmation 
of the wave functions obtained for the two-dimensional potential energy well. 


Degeneracy Occasionally it happens that two different sets of quantum num- 
bers n, andn,, have exactly the same energy. This situation is known as degeneracy, 
and the energy levels are said to be degenerate. For example, the energy level 
at E = 13£E, is degenerate, because both n, = 2, ny = 3 and n, = 3, ny = 2 have 
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FIGURE 5.19 Two very different probability densities with exactly the same energy. 


E = 13 . This degeneracy arises from interchanging n, and n, (which is the 
same as interchanging the x and y axes), so the probability distributions in the 
two cases are not very different. However, consider the state with E = 50Ep, for 
which there are three sets of quantum numbers: n, = 7,n, = 1; n, = 1, ny, = 73 
and n, = 5,n, = 5. The first two sets of quantum numbers result from the inter- 
change of , and n, and so have similar probability distributions, but the third 
represents a very different state of motion, as shown in Figure 5.19. The level 
at E = 13£) is said to be two-fold degenerate, while the level at E = 50Ep is 
three-fold degenerate; we could also say that one level has a degeneracy of 2, 
while the other has a degeneracy of 3. 

Degeneracy occurs in general whenever a system is labeled by two or more 
quantum numbers; as we have seen in the above calculation, different combinations 
of quantum numbers often can give the same value of the energy. The number 
of different quantum numbers required by a given physical problem turns out 
to be exactly equal to the number of dimensions in which the problem is 
being solved—one-dimensional problems need only one quantum number, two- 
dimensional problems need two, and so forth. When we get to three dimensions, 
as in Problem 19 at the end of this chapter and especially in the hydrogen atom in 
Chapter 7, we find that the effects of degeneracy become more significant; in the 
case of atomic physics, the degeneracy is a major contributor to the structure and 
properties of atoms. 


5.9 THE SIMPLE HARMONIC OSCILLATOR 


Another situation that can be analyzed using the Schrédinger equation is the 
one-dimensional simple harmonic oscillator. The classical oscillator is an object 
of mass m attached to a spring of force constant k. The spring exerts a restoring 
force F = —kx on the object, where x is the displacement from its equilibrium 
position. Using Newton’s laws, we can analyze the oscillator and show that it has 
a (circular or angular) frequency wy = ,/k/m and a period T = 27,/m/k. The 
maximum distance of the oscillating object from its equilibrium position is x9, 
the amplitude of the oscillation. The oscillator has its maximum kinetic energy at 
x = 0; its kinetic energy vanishes at the turning points x = +x. At the turning 
points the oscillator comes to rest for an instant and then reverses its direction of 
motion. The motion is, of course, confined to the region —xy < x < +x. 
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ly 








FIGURE 5.20 The probability density 
for the ground state of the simple har- 
monic oscillator. The classical turning 
points are at x = +Xp. 


Why analyze the motion of such a system using quantum mechanics? Although 
we never find in nature an example of a one-dimensional quantum oscillator, there 
are systems that behave approximately as one—a vibrating diatomic molecule, 
for example. In fact, any system in a smoothly varying potential energy well near 
its minimum behaves approximately like a simple harmonic oscillator. 

A force F = —kx has the associated potential energy U = 5 hx’, and so we 
have the Schrédinger equation: 


—-——_ 4 kh’ y = Ey (5.46) 


(Because we are working in one dimension, U and y are functions only of x.) 
There are no boundaries between different regions of potential energy here, so 
the wave function must fall to zero for both x — +00 and x — —oo. The simplest 
function that satisfies these conditions, which turns out to be the correct ground 
state wave function, is yx) = Ae~®". The constant a and the energy E can be 
found by substituting this function into Eq. 5.46. We begin by evaluating d? y/dx’. 





d 
ch a ee 
dx 
a 
a = —2a(de~™) — 2ax(—2ax)Ae~™ = (—2a + 4a?x?) Ae ™ 
X 
Substituting into Eq. 5.46 and canceling the common factor Ae~®™ yields 
Wa Wh, 1 
OS e+ ake HE (5.47) 
m 2 


Equation 5.47 is not an equation to be solved for x, because we are looking for 
a solution that is valid for any x, not just for one specific value. In order for this 
to hold for any x, the coefficients of x* must cancel and the remaining constants 
must be equal. (That is, consider the equation bx? = c. It will be true for any and 
all x only if both b = 0 and c = 0.) Thus 

2 





Ih 1 ha 
_ +-k=0 and —=F (5.48) 
2 m 
which yield 
Vk 1 
a= and SE = =f //m (5.49) 
2h 2 
We can also write the energy in terms of the classical frequency w) = \/k/m as 
1 
E= zh (5.50) 


The coefficient A is found from the normalization condition (see Problem 20 at the 
end of the chapter). The result, which is valid on/y for this ground-state wave func- 
tion, is 4 = (mw, /fz)'/*. The complete wave function of the ground state is then 


W(x) = (—) sa en Vk / 2h)? (5.51) 
hn 
The probability density for this wave function is illustrated in Figure 5.20. Note 
that, as in the case of the finite potential energy well, the probability density can 
penetrate into the forbidden region beyond the classical turning points at x = +x, 
(in this region the potential energy is greater than £). 
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The solution we have found corresponds only to the ground state of the 
oscillator. The general solution is of the form w,,(x) = Af, (x)e“@ , where f,, (x) is 
a polynomial in which the highest power of x is x”. The corresponding energies are 





1 
ee (n+ 5) hon n=0,1,2,... (5.52) 





These levels are shown in Figure 5.21. Note that they are uniformly spaced, 
in contrast to the one-dimensional infinite potential energy well. Probability 
densities are shown in Figure 5.22. All of the solutions have the property of 
penetration of probability density into the forbidden region beyond the classical 
turning points. The probability density oscillates, somewhat like a sine wave, 
between the turning points, and decreases like e-®” to zero beyond the turning 
points. Note the great similarity between the probability densities for the quantum FIGURE 5.21 Energy levels of the 
oscillator and those of the finite potential energy well (Figure 5.14). simple harmonic oscillator. Note that 
A sequence of vibrational excited states similar to Figure 5.21 is commonly the levels have equal spacings and 
found in diatomic molecules such as HCI (see Chapter 9). The spacing between _ that the distance between the classical 
the states is typically 0.1—1eV; the states are observed when photons (in the _ turning points increases with energy. 
infrared region of the spectrum) are emitted or absorbed as the molecule jumps 
from one state to another. A similar sequence is observed in nuclei, where the 
spacing is 0.1—1 MeV and the radiations are in the gamma-ray region of the 
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FIGURE 5.22 Probability densities for the simple harmonic oscillator. Note how the distance between the classical 
turning points (marked by the short vertical lines) increases with energy. Compare with the probability densities for 
the finite potential energy well (Figure 5.14). 


| Example 5.5 





An electron is bound to a region of space by a springlike 5 
force with an effective spring constant of k = 95.7 eV/nm*. _ 1 (197 eV - nm) | 95.7eV/nm 
(a) What is its ground-state energy? (b) How much energy 2 0.511 x 10° eV 


must be absorbed for the electron to jump from the ground 


state to the second excited state? = 135eV 
Solution 
(a) The ground-state energy is (b) The difference between adjacent energy levels is 


fiw) = 2.70 eV for all energy levels, so the energy that 
1 1 /k 1, [k 
ae afte must be absorbed to go from the ground state to the second 
2 2Vm 2 mc? 





excited state is AE = 2 x 2.70eV = 5.40 eV. 
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For the electron of Example 5.5 in its ground state, what 
is the probability to find it in a narrow interval of width (“ 511 x 10° eV) (2.70 eV) 


= (197eV-nm)2z 





0.004 nm located halfway between the equilibrium position = 


1/4 
) = 1.83nm'/? 
and the classical turning point? 








Solution a= vin = vikinc* 

First we need to find the location of the turning point. At 2h 2he 

the classical turning points x = x9, the kinetic energy is 5 

zero and so the total energy is all potential. Thus EF = 5 hx, _ ‘ji (95.7 eV/nm’)(0.511 x 10° eV) 
and so — 2(197 eV -nm) 





/2E 2(1.35eV = —2 
oss = ( € ) = 0.168 nm = 17.74nm 
k 95.7 eV/nm 


The probability in the interval dx = 0.004 nm at x = x,/2 


; : —ax? 
Evaluating the parameters of the wave function Ae (the = 0084 amie then 


normalization constant A and the exponential coefficient 
a), we have P(x) dx = |W) 2 de = A202" ay 
fa eau _ (mon) 1/4 ~ (1.83 nm~!/2)29-2(017.74am™)(0.0842m)°(Q 994 nx) 
ue wee = 0.0104 = 1.04% 








As we did in the case of the infinite potential energy well, let’s look at the 
application of the uncertainty principle to the wave packet represented by the 
harmonic oscillator. Using the results of Problems 22 and 23 for the uncertainties 
in position and momentum for the ground state of the oscillator, Ax = /i/2m wo 
and Ap = ,/fwam/2, the product of the uncertainties is AxAp = h/2. This is the 
minimum possible value for this product, according to Eq. 4.10. The ground state 
of the oscillator thus represents the most “compact” wave packet in which the 
product of the uncertainties has its smallest value. You can see from Figure 5.22 
that the excited states of the oscillator are much less compact (more spread out) 
than the ground state. 


5.6 STEPS AND BARRIERS 


In this general type of problem, we analyze what happens when a particle moving 
(again in one dimension) in a region of constant potential energy suddenly moves 
into a region of different, but also constant, potential energy. We will not discuss 
in detail the solutions to these problems, but the methods of solution of each are 
so similar that we can outline the steps to take in the solution. In this discussion, 
we let E be the (fixed) total energy of the particle and U, will be the value of 
the constant potential energy. In these calculations, the particle is not confined, 
so the energy is not quantized—we are free to choose any value for the particle 
energy. 


Potential Energy Step, E > Uy 


Consider the potential energy step shown in Figure 5.23: 


x <0 


x>0 


Ux) = 0 

=U) (5.53) 
If the total energy E of the particle is greater than Up, then we can write the 
solutions to the Schrodinger equation in the two regions based on the general 
form of Eq. 5.16: 


2mE 


h2 


5 
W(x) =Csinkx+Deoskx kj = e-) x>0 (5.54b) 


Relationships among the four coefficients, 4,B,C, and D, may be found by 
applying the condition that w(x) and w’(x) = dy/dx must be continuous at the 
boundary; thus (0) = yw, (0) and y4(0) = x; (0). A typical solution might look 
like Figure 5.24. Note the smooth transition between the solutions at x = 0, which 
results from applying the continuity conditions. 

The coefficients A,B,C, and D are in general complex, so to visualize the 
complete wave we need both the real and imaginary parts of y. We can use the 
equation e” = cos @ + isin@ to transform these solutions from sines and cosines 
to complex exponentials: 


Wo(x) = Asinkyx + Bcos kyx ky = x <0 (5.54a) 


(x) = A’elo* + Beer x <0 
0 


Wy (x) = Celkix aa Deih* 


(5.55a) 


x>0 (5.55b) 


The coefficients A’, B’, C’, D’ can be found from the coefficients A, B, C, D. The 
time dependent wave functions are obtained by multiplying each term by e7'”, 
which gives 
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FIGURE 5.23 A step of height Up. 
Particles are incident from the left with 
energy E. The kinetic energy is equal 
to £ in the region x < 0 and is reduced 
to E — Up in the region x > 0. 





Wo (x,t) = A’el or 4. Bier +o) (5.56a) 
Wo = Cighir-©) 4 pi er tian) (5.56b) 
lv? lv? 



































(a) (b) 


FIGURE 5.24 Wave function for electrons incident from the left on a potential energy step for 
E> Up. The probability density and the real and imaginary parts of the wavefunction are shown for 


(a) t = 0 and (b) t = '/4 period. The vertical line marks the location of the step. 
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We can then make the following identification of the component waves, recalling 
that (Ax — wf) is the phase of a wave moving in the positive x direction, while 
(kx + wt) is the phase of a wave moving in the negative x direction, and 
assuming that the squared magnitude of each coefficient gives the intensity of 
the corresponding component wave. In the region x < 0, Eq. 5.56a describes 
the superposition of a wave e“‘0*-® of intensity |A’|? moving in the positive x 
direction (from —oo to 0) and a wave e~!“o*+@ of intensity |B’|? moving in the 
negative x direction. Suppose we had intended our solution to describe particles 
that are incident from the left on this step. Then |4’|? gives the intensity of the 
incident wave (more exactly, the de Broglie wave describing the incident beam of 
particles) and |B’|? gives the intensity of the reflected wave. The ratio |B’|?/|A’|* 
tells us the reflected fraction of the incident wave intensity. 

In the region x >0, Eq. 5.56b describes the transmitted wave e!1*-© of 
intensity |C’|? moving to the right and a wave e~!1*+9 of intensity |D’|* moving 
to the left. If particles are incident from —o., it is not possible to have particles in 
the region x > 0 moving to the left, so in this particular experimental situation we 
are justified in setting D’ to zero. 

Figure 5.24a shows that the probability density has the same value everywhere 
in the region x > 0. You can see this immediately from Eq. 5.565 with D’ = 0; 
taking the squared magnitude of the remaining term gives a constant result, 
independent of x and ¢. This is consistent with what we expect for the de Broglie 
wave of free particles; the particles can be found anywhere in the region x > 0 
with equal probability. 

In the region x < 0, the incident and reflected waves combine to produce a 
standing wave, for which the probability density has fixed maxima and minima. 
The probability density in this region does not vary with time, as suggested by the 
plots for the two different times (¢ = 0 and ¢t = 1/4 period) shown in Figure 5.24. 

To illustrate the propagation of the de Broglie wave, it is instructive also 
to plot the real and imaginary parts of the wave function, which are shown in 
Figure 5.24. Here you can see the change in wavelength (corresponding to the 
change in kinetic energy or momentum) in crossing the step. You can also see 
something of the time dependence—the wave propagates in both regions, but it 
does so in a way that the real and imaginary parts combine to give a probability 
density that remains unchanged in time. 


Potential Energy Step, E < Ug 


If the energy of the particle is less than the height of the potential energy step, 
then the solution in the region x > 0 is of the form of Eq. 5.18: 


2mE 
ie 


2 
We) = Cet + Deh ky, =,4/ <r (Uy —E) x>0 (5.57b) 


We set C= 0 to keep w(x) from becoming infinite as x — oo, and we apply 
the boundary conditions on (x) and w'(x) at x = 0. The resulting solution is 
shown in Figure 5.25. The probability density for x > 0 shows penetration into the 
classically forbidden region. All particles are reflected from the barrier; that is, if 
we write Wo(x,f) in the form of Equation 5.56a, then we must have |4’| = |B’, 
indicating that the waves moving to the right (the incident wave) and to the left 
(the reflected wave) in the region x < 0 have equal amplitudes. 


Wo(x) = Asinkyx + Bcos kyx ky = x < 0 (5.57a) 
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FIGURE 5.25 Wave function for electrons incident from the left on a potential energy step for 
E < U,. The probability density and the real and imaginary parts of the wavefunction are shown 


for (a) t = 0 and (b) t = '/4 period. The vertical line marks the location of the step. 


Figure 5.25 shows the probability density at two different times, illustrating that 
the probability density does not change with time. In the region x < 0 we again 
have standing waves with fixed maxima and minima. Viewing the real and imag- 
inary parts at two different times (f = 0 and ¢ = 1/4 period) shows that the wave 
is propagating, even though the probability density does not change with time. 

Penetration into the forbidden region is associated with the wave nature of the 
particle and also with the uncertainty in the particle’s energy or location. The 
probability density in the x > 0 region is |y,|°, which according to Eq. 5.57b is 
proportional to e~*“:*, If we define a representative penetration distance Ax to be 
the distance over which the probability drops by 1/e, then e~"14* = e~! and so 

1 1 h 


Ax = — 5.58 
2k, 2 \/2m(Uy — E) ner 





To be able to enter the region with x > 0, the particle must gain an energy 
of at least Uj — E in order to get over the potential energy step; it must in 
addition gain some kinetic energy if it is to move in the region x > 0. Of course, 
it is a violation of conservation of energy for the particle to spontaneously gain 
any amount of energy, but according to the uncertainty relationship AE At ~ i 
conservation of energy does not apply at times smaller than At except to within 
an amount AE ~ fi/At. That is, if the particle “borrows” an amount of energy 
AE and “returns” the borrowed energy within a time At ~ i/AE, we observers 
will still believe energy is conserved. Suppose the particle borrows an energy 
sufficient to give it a kinetic energy of K in the forbidden region. How far into the 
forbidden region does the particle penetrate? 

The “borrowed” energy is (U, — E) + K; the energy (Uy — £) gets the particle 
to the top of the step, and the extra kinetic energy K gives it its motion. The 
energy must be returned within a time 

h 


At = ——__. 5.59 
Uy -E+K on 


The particle moves with speed v = ,/2K/m, and so the distance it can travel is 


ho ae (5.60) 
a= 9 = ON me Uy SEER 
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FIGURE 5.26 A barrier of height Uj 
and width L. 




















FIGURE 5.27 The real part of the 
wave function of a particle of energy 
E < U, encountering a barrier (the 
particle is incident from the left in 
the figure). The wavelength A, is the 
same on both sides of the barrier, but 
the amplitude beyond the barrier is 
much less than the original amplitude. 


(The factor of 1/2 is present because in the time Af the particle must penetrate the 
distance Ax into the forbidden region and return through that same distance to the 
allowed region.) 

In the limit K — 0, the penetration distance Ax goes to 0 according to Eq. 5.60 
because the particle has zero velocity; similarly, Ax — 0 in the limit K > ~, 
because it moves for a vanishing time interval Ar. In between those limits, there 
must be a maximum value of Ax for some particular K. Differentiating Eq. 5.60 
with respect to K, we can find the maximum value 


1 h 


At = -<—=== 
me 2 2m(U — B) 


(5.61) 


This value of Ax is identical with Eq. 5.58! This demonstrates that the penetration 
into the forbidden region given by the solution to the Schrédinger equation is 
entirely consistent with the uncertainty relationship. (The agreement between 
Eqs. 5.58 and 5.61 is really somewhat accidental, because the factor 1/e used to 
obtain Eq. 5.58 was chosen arbitrarily. What we have really demonstrated is that 
the estimates of uncertainty given by the Heisenberg relationships are consistent 
with the wave properties of the particle obtained from the Schrédinger equation. 
This should not be surprising, because the uncertainty principle can be derived as 
a consequence of the Schrédinger equation.) 


Potential Energy Barrier 
Consider now the potential energy barrier shown in Figure 5.26: 
Ux) = 0 x <0 
=U, OK<x<L (5.62) 
=0 x>L 


Particles with energy E less than Up are incident from the left. Our experience 
then leads us to expect solutions of the form shown in Figure 5.27—sinusoidal 
oscillation in the region x < 0 (an incident wave and a reflected wave), exponen- 
tials in the region 0 < x < L, and sinusoidal oscillations in the region x > L (the 
transmitted wave). Note that the intensity of the transmitted wave (x > L) is much 
smaller than the intensity of the incident + reflected waves (x < 0), which means 
that most of the particles are reflected and few are transmitted through the barrier. 
Also note that the wavelengths are the same on either side of the barrier (because 
the kinetic energies are the same). 

The intensity of the transmitted wave, which can be found by application of the 
continuity conditions, depends on the energy of the particle and on the height and 
thickness of the barrier. Classically, the particles should never appear at x > L, 
because they do not have sufficient energy to overcome the barrier. This situation 
is an example of barrier penetration, sometimes called quantum mechanical 
tunneling. Particles can not be observed while they are in the classically forbidden 
region 0 < x < L, but can “tunnel” through that region and be observed at x > L. 

Every particle incident on the barrier of Figure 5.26 is either reflected or 
transmitted; the number of incident particles is equal to the number reflected 
back to x < 0 plus the number transmitted to x > L. None are “trapped” or ever 
seen in the forbidden region 0 < x < L. How can the incident particle get from 
x <0tox>L? Asa classical particle, it can’t! However, the wave representing 
the particle can penetrate through the barrier, which allows the particle to be 
observed in the classically allowed region x > L. 


Air 
Glass 




















(a) (b) 


FIGURE 5.28 (a) Total internal reflection of light waves at a glass-air 
boundary. (b) Frustrated total internal reflection. The thicker the air 
gap, the smaller the probability to penetrate. Note that the light beam 
does not appear in the gap. 


This phenomenon of penetration of a forbidden region is a well-known property 
of classical waves. Quantum physics provides a new aspect to this phenomenon by 
associating a particle with the wave, and thus allowing a particle to pass through 
a classically forbidden region. An example of the penetration effect for classical 
waves occurs for total internal reflection* of light waves. Figure 5.28a shows a 
light beam in glass incident on a boundary with air. The beam is totally reflected 
in the glass. However, if a second piece of glass is brought close to the first, as 
in Figure 5.285, the beam can appear in the second piece of glass. This effect is 
called frustrated total internal reflection. The intensity of the beam in the second 
piece, represented by the widths of the arrows in Figure 5.284, decreases rapidly 
as the thickness of the gap increases. 

Just like our unobservable quantum wave, which penetrates a few wavelengths 
into the forbidden region, an unobservable light wave of exponentially decreasing 
amplitude, the evanescent wave, penetrates into the air even when the light wave 
undergoes total reflection in the glass. The evanescent wave carries no energy 
away from the interface, so it cannot be directly observed in the air, but it can be 
observed in another medium such as a second piece of glass placed close to the 
first. Evanescent waves have applications in microscopy, where they enable the 
production of images of individual molecules. 

Although the potential energy barrier of Figure 5.26 is rather artificial, there 
are many practical examples of quantum tunneling: 


1. Alpha Decay. An atomic nucleus consists of protons and neutrons in a 
constant state of motion; occasionally these particles form themselves into an 
aggregate of two protons and two neutrons, called an alpha particle. In one 
form of radioactive decay, the nucleus can emit an alpha particle, which can 
be detected in the laboratory. However, in order to escape from the nucleus 
the alpha particle must penetrate a barrier of the form shown in Figure 5.29. 
The probability for the alpha particle to penetrate the barrier, and be detected 
in the laboratory, can be computed based on the energy of the alpha particle 


*Total internal reflection occurs when a light beam is incident on a boundary between two substances, 
such as glass and air, from the side with the higher index of refraction. If the angle of incidence inside 
the glass exceeds a certain critical value, the light beam is totally reflected back into the glass. 
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FIGURE 5.29 (a) A nuclear poten- 
tial energy barrier is penetrated by 
an alpha particle of energy E. (b) A 
representation of the real part of the 
wave function of the alpha particle. 
The probability to penetrate the bar- 
rier depends strongly on the energy of 
the alpha particle. 
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and the height and thickness of the barrier. The decay probability can be 
measured in the laboratory, and it is found to be in excellent agreement with 
the value obtained from a quantum-mechanical calculation based on barrier 
penetration. 

Ammonia Inversion. Figure 5.30 is a representation of the ammonia molecule 
NH. If we were to try to move the nitrogen atom along the axis of the molecule, 
toward the plane of the hydrogen atoms, we find repulsion caused by the three 
hydrogen atoms, which produces a potential energy of the form shown in 
Figure 5.31. According to classical mechanics, unless we give the nitrogen 
atom sufficient energy, it should not be able to surmount the barrier and appear 
on the other side of the plane of hydrogens. According to quantum mechanics, 
the nitrogen can tunnel through the barrier and appear on the other side of the 
molecule. In fact, the nitrogen atom actually tunnels back and forth with a 
frequency in excess of 10!° oscillations per second. 

The Tunnel Diode. A tunnel diode is an electronic device that uses the 
phenomenon of tunneling. Schematically, the potential energy of an electron 
in a tunnel diode can be represented by Figure 5.32. The current that flows 
through the device is produced by electrons tunneling through the barrier. 
The rate of tunneling, and therefore the current, can be regulated merely by 
changing the height of the barrier, which can be done with an applied voltage. 
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FIGURE 5.80 A schematic dia- 
gram of the ammonia molecule. 
The Coulomb repulsion of the 
three hydrogens establishes a bar- 
rier against the nitrogen atom 
moving to a symmetric position 
(shown in dashed lines) on the 
opposite side of the plane of 
hydrogens. 
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FIGURE 5.31 The potential en- 
ergy seen by the nitrogen atom 
in an ammonia molecule. The 
nitrogen can penetrate the barrier 
and move from one equilibrium 
position to another. 


FIGURE 5.32 The potential 
energy barrier seen by an elec- 


tron in a tunnel diode. The 
conductivity of the device is 
determined by the electron’s 
probability to penetrate the bar- 
rier, which depends on the 
height of the barrier. 


This can be done rapidly, so that switching frequencies in excess of 10° Hz 
can be obtained. Ordinary semiconductor diodes depend on the diffusion of 
electrons across a junction, and therefore operate on much longer time scales 
(that is, at lower frequencies). 

The Scanning Tunneling Microscope. Images of individual atoms on the 
surface of materials (such as Figure 5.18) can be made with the scanning 
tunneling microscope. Electrons are trapped in a surface by a potential energy 
barrier (the work function of the material). When a needlelike probe is placed 
within about | nm of the surface (Figure 5.33), electrons can tunnel through 
the barrier between the surface and the probe and produce a current that can 
be recorded in an external circuit. The current is very sensitive to the width of 
the barrier (the distance from the probe to the surface). In practice, a feedback 
mechanism keeps the current constant by moving the tip up and down. The 
motion of the tip gives a map of the surface that reveals details smaller 
than 0.01 nm, about 1/100 the diameter of an atom! For the development of 
the scanning tunneling microscope, Gerd Binnig and Heinrich Rohrer were 
awarded the 1986 Nobel Prize in physics. 
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FIGURE 5.33 In a scanning tunnel- 
ing microscope, a needlelike probe is 
scanned over a surface. The probe is 
moved vertically so that the distance 
between the probe and the surface 
remains constant as the probe scans 


laterally. 
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Newton’s laws can be solved to give the future behavior of 
a particle. In what sense does the Schrodinger equation also 
do this? In what sense does it not? 

Why is it important for a wave function to be normalized? Is 
an unnormalized wave function a solution to the Schrédinger 
equation? 

What is the physical meaning of ies lwiPdx =1? 

What are the dimensions of w(x)? Of Wx, y)? 

None of the following are permitted as solutions of the 
Schrédinger equation. Give the reasons in each case. 


(a) Wx) =A cos kx x<0 
W(x) = Bsinkx x>0 
(b) W(x) = Ax !e® =P2x <1 
(ce) Wx) =Asin | kx 
(d) w(x) =Atankx x>0 


What happens to the probability density in the infinite well 
when n — oc? Is this consistent with classical physics? 
How would the solution to the infinite potential energy well 
be different if the well extended from x = x9 tox = x9 + L, 
where xy is a nonzero value of x? Would any of the measur- 
able properties be different? 

How would the solution to the one-dimensional infinite 
potential energy well be different if the potential energy 
were not zero for 0 < x < L but instead had a constant value 
Uy? What would be the energies of the excited states? What 
would be the wavelengths of the standing de Broglie waves? 
Sketch the behavior of the lowest two wave functions. 
Assuming a pendulum to behave like a quantum oscilla- 
tor, what are the energy differences between the quantum 


5.1 Behavior of a Wave at a Boundary 


i 


A ball falls from rest at a height H above a lake. Let y = 0 
at the surface of the lake. As it falls, it experiences a gravi- 
tational force —mg. When it enters the water, it experiences 
a buoyant force B so the net force in the water is B — mg. 
(a) Write expressions for v(t) and y(t) while the ball is falling 
in air. (b) In the water, let v,(t) = at+ band y,(t) = sat” + 
bt +c where a = (B— mg)/m. Use the continuity condi- 
tions at the surface of the water to find the constants b and c. 
A wave has the form y = A cos(27x/A + 2/3) when x < 0. 
For x>0, the wavelength is 4/2. By applying continuity 
conditions at x = 0, find the amplitude (in terms of A) and 
phase of the wave in the region x >0. Sketch the wave, 
showing both x < 0 andx> 0. 


10. 


11. 


12. 


13; 


14. 


15. 


16. 


17. 


states of a pendulum of length 1 m? Are such differences 
observable? 

For the potential energy barrier (Figure 5.26), is the wave- 
length for x > L the same as the wavelength for x < 0? Is 
the amplitude the same? 

Suppose particles were incident on the potential energy step 
from the positive x direction. Which of the four coefficients 
of Eq. 5.56 would be set to zero? Why? 

The energies of the excited states of the systems we have dis- 
cussed in this chapter have been exact—there is no energy 
uncertainty. What does this suggest about the lifetime of 
particles in those excited states? Left on its own, will a 
particle ever make transitions from one state to another? 
Explain how the behavior of a particle in a one-dimensional 
infinite well can be considered in terms of standing de 
Broglie waves. 

How would you design an experiment to observe barrier 
penetration with sound waves? What range of thicknesses 
would you choose for the barrier? 

If Up were negative in Figure 5.26, how would the wave 
functions appear for E > 0? 

Does Eq. 5.2 imply that we know the momentum of the 
particle exactly? If so, what does the uncertainty principle 
indicate about our knowledge of its location? How can you 
reconcile this with our knowledge that the particle must be 
in the well? 

Do sharp boundaries and discontinuous jumps of potential 
energy occur in nature? If not, how would our analysis of 
potential energy steps and barriers be different? 


5.2 Confining a Particle 


3. 


The lowest energy of a particle in an infinite one-dimensional 
well is 4.4 eV. If the width of the well is doubled, what is its 
lowest energy? 

An electron is trapped in an infinite well of width 0.120 nm. 
What are the three longest wavelengths permitted for the 
electron’s de Broglie waves? 

An electron is trapped in a one-dimensional region of width 
0.050 nm. Find the three smallest possible values allowed 
for the energy of the electron. 

What is the minimum energy of a neutron (mc* = 940 MeV) 
confined to a region of space of nuclear dimensions 
(1.0 x 1074 m)? 


5.3 The Schradinger Equation 


Te 


10. 


In the region 0<x<a, a particle is described by 
the wave function w(x) = —b(x* — a’). In the region 
a<x<w, its wave function is W(x) = (x — d)* —c. For 
x > w, W3(x) = 0. (a) By applying the continuity conditions 
atx = a, findc and d interms ofa and b. (5) Find w in terms of 
aand b. 

A particle is described by the wave function 
W(x) = b(a* — x’) for —a<x<+a and Wx) =0 for 
x < —a and x > +a, where a and 5 are positive real con- 
stants. (a) Using the normalization condition, find } in terms 
of a. (b) What is the probability to find the particle at 
x = +a/2 in a small interval of width 0.010a? (c) What is 
the probability for the particle to be found between x = +a/2 
and x = +a? 

In a certain region of space, a particle is described by the 
wave function W(x) = Cxe~>* where C and b are real con- 
stants. By substituting into the Schrédinger equation, find 
the potential energy in this region and also find the energy of 
the particle. (Hint: Your solution must give an energy that 
is a constant everywhere in this region, independent of x.) 
A particle is represented by the following wave function: 


Wx) = 0 x <—L/2 
= C(2x/L+ 1) —Ij22e20 
=C(-2x/L+1)  0<x<+4L/2 
=0 x>4+L/2 


(a) Use the normalization condition to find C. (6) Eval- 
uate the probability to find the particle in an interval of 
width 0.010Z at x = L/4 (that is, between x = 0.245 and 
x = 0.255L. (No integral is necessary for this calculation.) 
(c) Evaluate the probability to find the particle between 
x = 0 and x = +L/4. (d) Find the average value of x and 


the rms value of X: Xms = V O7)ay- 


5.4 Applications of the Schrodinger Equation 


11. 


12. 


13. 
14. 


A particle in an infinite well is in the ground state with an 
energy of 1.26eV. How much energy must be added to the 
particle to reach the second excited state (n = 3)? The third 
excited state (n = 4)? 

An electron is trapped in an infinitely deep one-dimensional 
well of width 0.251 nm. Initially the electron occupies the 
n = 4 state. (a) Suppose the electron jumps to the ground 
state with the accompanying emission of a photon. What 
is the energy of the photon? (5) Find the energies of other 
photons that might be emitted if the electron takes other 
paths between the n = 4 state and the ground state. 

Show that Eq. 5.31 gives the value A = af FFL, 

A particle is trapped in an infinite one-dimensional well 
of width L. If the particle is in its ground state, evaluate 
the probability to find the particle (a) between x = 0 and 


15. 


16. 


17. 


18. 


19. 


20. 


21. 


22, 


23. 


24. 


25. 
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x = L/3; (b) between x = L/3 and x = 2L/3; (c) between 
x = 2L/3 andx = L. 

A particle is confined between rigid walls separated by a 
distance L = 0.189 nm. The particle is in the second excited 
state (n = 3). Evaluate the probability to find the particle in 
an interval of width 1.00 pm located at: (a) x = 0.188 nm; 
(6b) x = 0.031 nm; (c) x = 0.079 nm. (Hint: No integra- 
tions are required for this problem; use Eq. 5.7 directly.) 
What would be the corresponding results for a classical 
particle? 

What is the next level (above E = 50E,) of the two- 
dimensional particle in a box in which the degeneracy 
is greater than 2? 

A particle is confined to a two-dimensional box of length L 
and width 2L. The energy values are E = (h2n / 2mL*)(n2 + 
n,/4). Find the two lowest degenerate levels. 

Show by direct substitution that Eq. 5.39 gives a solution to 
the two-dimensional Schrédinger equation, Eq. 5.37. Find 
the relationship between k,,, k,,, and E. 

A particle is confined to a three-dimensional region of 
space of dimensions L by L by L. The energy levels 
are (hi? x? /2mL7) (n2 +r +72), where Ny Ny, and n, are 
integers > 1. Sketch an energy level diagram, showing the 
energies, quantum numbers, and degeneracies for the lowest 
10 energy levels. 


The Simple Harmonic Oscillator 


Using the normalization condition, show that the constant A 
has the value (maw /fz)'/* for the one-dimensional simple 
harmonic oscillator in its ground state. 

(a) At the classical turning points +x, of the simple harmonic 
oscillator, K = Oandso£ = U. From this relationship, show 
that x) = (Aw /k)'/* for an oscillator in its ground state. 
(b) Find the turning points in the first and second excited 
states. 

Use the ground-state wave function of the simple harmonic 
oscillator to find x,,,, (x7) ays and Ax. Use the normalization 
constant 4 = (moy/fin)'*. 

(a) Using a symmetry argument rather than a calculation, 
determine the value of p,,, for a simple harmonic oscillator. 
(b) Conservation of energy for the harmonic oscillator can 
be used to relate p? to x*. Use this relation, along with 
the value of (x7)ay from Problem 22, to find (D7 )ay for the 
oscillator in its ground state. (c) Using the results of parts a 
and b, show that Ap = /hwym/2. 

The ground state energy of an oscillating electron is 1.24 eV. 
How much energy must be added to the electron to move it 
to the second excited state? The fourth excited state? 
Compare the probabilities for an oscillating particle in its 
ground state to be found in a small interval of width dx at 
the center of the well and at the classical turning points. 


168 Chapter 5 | The Schrédinger Equation 


5.6 Steps and Barriers 


26. 


27. 


28. 


29. 


30. 


Find the value of K at which Eq. 5.60 has its maximum 
value, and show that Eq. 5.61 is the maximum value of Ax. 
For a particle with energy E < Uj incident on the potential 
energy step, use Wp and y, from Eqs. 5.57, and evaluate the 
constants B and D in terms of A by applying the boundary 
conditions at x = 0. 

Using the wave functions of Eq. 5.55 for the potential energy 
step, apply the boundary conditions of w and dw/dx to find 
B’ and C’ in terms of A’, for the potential step when particles 
are incident from the negative x direction. Evaluate the ratios 
|B’? /|A'|? and |C’|?/|A’|? and interpret. 

(a) Write down the wave functions for the three regions 
of the potential energy barrier (Figure 5.26) for E < Up. 
You will need six coefficients in all. Use complex expo- 
nential notation. (b) Use the boundary conditions at x = 0 
and at x = L to find four relationships among the six coeffi- 
cients. (Do not try to solve these relationships.) (c) Suppose 
particles are incident on the barrier from the left. Which 
coefficient should be set to zero? Why? 

Repeat Problem 29 for the potential energy barrier when 
E> Up, and sketch a representative probability density that 
shows several cycles of the wave function. In your sketch, 
make sure the amplitude and wavelength in each region 
accurately describe the situation. 


General Problems 


31. 


32. 


33. 


34. 


35. 


An electron is trapped in a one-dimensional well of width 
0.132nm. The electron is in the n = 10 state. (a) What 
is the energy of the electron? (b) What is the uncertainty 
in its momentum? (Hint: Use Eq. 4.10.) (c) What is the 
uncertainty in its position? How do these results change as 
n — oo? Is this consistent with classical behavior? 

Sketch the form of a possible solution to the Schrédinger 
equation for each of the potential energies shown in 
Figure 5.34. The potential energies go to infinity at the 
boundaries. In each case show several cycles of the wave 
function. In your sketches, pay attention to the continu- 
ity conditions (where applicable) and to changes in the 
wavelength and amplitude. 

Show that the average value of x” in the one-dimensional 
infinite potential energy well is L7(1/3 — 1/2n*z?). 

Use the result of Problem 33 to show that, for the infi- 
nite one-dimensional well, defining Ax = \/(x*), — gy)? 
gives Ax = Ly/1/12 — 1/27?n?. 

(a) In the infinite one-dimensional well, what is 
Pay? (Use a symmetry argument.) (b) What is (p”),,,? [Hint: 
What is (p?/2m),.,?] (c) Defining Ap = (Pp) ay — (Pay)? 
show that Ap = hn/2L. 


36. 


37. 


38. 








FIGURE 5.34 Problem 32. 


The first excited state of the harmonic oscillator has a wave 
function of the form W(x) = Axe~® Follow the method 
outlined in Section 5.5 to find a and the energy E£. Find the 
constant A from the normalization condition. 

Using the normalization constant A from Problem 20 and 
the value of a from Eq. 5.49, evaluate the probability to 
find an oscillator in the ground state beyond the classical 
turning points +x). This problem cannot be solved in closed, 
analytic form. Develop an approximate, numerical method 
using a graph, calculator, or computer. Assume an elec- 
tron bound to an atomic-sized region (x) = 0.1 nm) with an 
effective force constant of 1.0 eV/nm”. 

A two-dimensional harmonic oscillator has energy E = 
lwo (n, + ny, + 1), where n,, and ny are integers beginning 
with zero. (a) Justify this result based on the energy of 
the one-dimensional oscillator. (b) Sketch an energy-level 
diagram similar to Figure 5.21, showing the lowest 4 energy 
levels. For each level, show the value of E (in units of 
hwy), the quantum numbers n, and n,, and the degener- 
acy. (c) Show that the degeneracy of each level is equal to 
ny + ny + 1, 
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Our goal in this chapter is to understand some of the details of atomic structure 
that can be learned from experimental studies of atoms. In particular, we discuss 
two types of experiments that are important in the development of our theory 
of atomic structure: the scattering of charged particles by atoms, which tells us 
about the distribution of electric charge in atoms, and the emission or absorption 
of radiation by atoms, which tells us about their excited states. 

We use the information obtained from these experiments to develop an atomic 
model, which helps us understand and explain the properties of atoms. A model 
is usually an oversimplified picture of a more complex system, which provides 
some insight into its operation but may not be sufficiently detailed to explain all 
of its properties. 

In this chapter, we discuss the experiments that led to the Rutherford-Bohr 
model (also known simply as the Bohr model), which is based on the familiar 
“planetary” structure in which the electrons orbit about the nucleus like planets 
about the Sun. Even though this model is not strictly valid from the standpoint of 
wave mechanics, it does help us understand many atomic properties, especially 
the excited states of the simplest atom, hydrogen. In Chapter 7, we show how 
wave mechanics changes our picture of the hydrogen atom, and in Chapter 8 we 
consider the structure of more complicated atoms. 


6.1 BASIC PROPERTIES OF ATOMS 


Before we begin to construct a model of the atom, it is helpful to summarize some 
of the basic properties of atoms. 


1. Atoms are very small, about 0.1nm (0.1 x 10~° m) in radius. Thus any 
effort to “see” an atom using visible light (A = 500 nm) is hopeless owing 
to diffraction effects. We can make a crude estimate of the maximum size 
of an atom in the following way. Consider a cube of elemental matter—for 
example, iron. Iron has a density of about 8 g/cm? and a molar mass of 
56 g. One mole of iron (56 g) contains Avogadro’s number of atoms, about 
6 x 1073. Thus 6 x 1073 atoms occupy about 7cm? and so 1 atom occupies 
about 10-73 cm. If we assume the atoms of a solid are packed together in the 
most efficient possible way, like hard spheres in contact, then the diameter of 
one atom is about /10-23 cm3 = 2 x 10-8cm = 0.2 nm. 

2. Atoms are stable—they do not spontaneously break apart into smaller pieces 
or collapse; therefore the internal forces that hold the atom together must be in 
equilibrium. This immediately tells us that the forces that pull the parts of an 
atom together must be opposed in some way; otherwise atoms would collapse. 

3. Atoms contain negatively charged electrons, but are electrically neutral. If we 
disturb an atom or collection of atoms with sufficient force, electrons are emit- 
ted. We learn this fact from studying the Compton effect and the photoelectric 
effect. We also learned in Chapter 4 that even though electrons are emitted from 
the nuclei of atoms in certain radioactive decay processes, they don’t “exist” 
in those nuclei but are manufactured there by some process. Electrons were 
excluded from the nucleus based on the uncertainty principle, which forbids 
emitted electrons of the energies observed in the laboratory from existing in the 
nucleus (see Example 4.7). The uncertainty principle places no such restriction 
on the existence of electrons in a volume as large as an atom (see Problem 1). 

We can also easily observe that bulk matter is electrically neutral, and we 
assume that this is likewise a property of the atoms. Experiments with beams 
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of individual atoms support this assumption. From these experimental facts 
we deduce that an atom with Z negatively charged electrons must also contain 
a net positive charge of Ze. 

4. Atoms emit and absorb electromagnetic radiation. This radiation may take 
many forms—visible light (A ~ 500 nm), X rays (A ~ | nm), ultraviolet rays 
(A ~ 10 nm), infraredrays(A ~ 0.1 jm), andso forth. In fact itis from observa- 
tion of these emitted and absorbed radiations, which can be measured with great 
precision, that we learn most of what we know about atoms. In a typical emis- 
sion measurement, an electric current is passed through a glass tube containing 
a small sample of the gas phase of the element under study, and radiation is 
emitted when an excited atom returns to its ground state. The absorption wave- 
lengths can be measured by passing a beam of white light through a sample of 
the gas and noting which colors are removed from the white light by absorption 
in the gas. One particularly curious feature of the atomic radiations is that atoms 
don’t always emit and absorb radiations at the same wavelengths— some wave- 
lengths present in the emission experiment do not also appear in the absorption 
experiment. Any successful theory of atomic structure must be able to account 
for these emission and absorption wavelengths. 


6.2 SCATTERING EXPERIMENTS AND THE 
THOMSON MODEL 





An early model of the structure of the atom was proposed (in 1904) by J. J. 
Thomson, who was known for his previous identification of the electron and 
measurement of its charge-to-mass ratio e/m. The Thomson model incorporates 
many of the known properties of atoms: size, mass, number of electrons, and 
electrical neutrality. In this model, an atom contains Z electrons that are embedded 
ina uniform sphere of positive charge (Figure 6.1). The total positive charge of the 
sphere is Ze, the mass of the sphere is essentially the mass of the atom (the electrons 
don’t contribute significantly to the total mass), and the radius R of the sphere is the 
radius of the atom. (This model is sometimes known as the “plum-pudding” model, 
because the electrons are distributed throughout the atom like raisins in a plum 
pudding.) As we will see, the Thomson model gives predictions that disagree with 
experiment, and so it is not the correct way of understanding the structure of atoms. 

One way of studying atoms is by probing the distribution of electric charge in 
their interior, which we can do by bombarding the atom with charged particles and 
observing the angle by which particles are deflected from their original direction. 
This type of experiment is called a scattering experiment. Ideally we would like 
to do this experiment with a single atom, such as is represented in Figure 6.2. 
The scattering angle 6 depends on the impact parameter b, which measures the 
distance from the center of the atom that a projectile would pass if it were not 
deflected. Each different value of the impact parameter results in a different value 
of the scattering angle. 

The particle is deflected from its original trajectory by the electrical forces 
exerted on the particle by the atom. For a positively charged particle, these forces 
are: (1) arepulsive force due to the positive charge of the atom, and (2) an attractive 
force due to the negatively charged electrons. We assume that the mass of the 
deflected particle is much greater than the mass of an electron but also much less 
than the mass of the atom. In the encounter between the projectile and an electron, 
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Electrons 





FIGURE 6.1 The Thomson model of 
the atom. Z electrons are imbedded in 
a uniform sphere of positive charge Ze 
and radius R. An imaginary spherical 
surface of radius r contains a fraction 
r/R of the positive charge. 








FIGURE 6.2 A positively charged par- 
ticle is deflected by an angle @ as it 
passes through a positively charged 
sphere, representing a Thomson model 
atom. The scattering angle depends on 
the value of the impact parameter 5, 
which varies from 0 to R. 
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FIGURE 6.3 Scattering by a thin foil. 
Some individual scatterings tend to 


increase 9, while others tend to 
decrease 0. 














Ernest Rutherford (1871-1937, Eng- 
land). Founder of nuclear physics, he 
is known for his pioneering work on 
alpha-particle scattering and radioac- 
tive decays. His inspiring leadership 
influenced a generation of British nu- 
clear and atomic scientists. 


the forces exerted on each by the other are equal and opposite (by Newton’s 
third law), and so the principal victim of the encounter is the much less massive 
electron; the effect on the projectile is negligible. (Imagine rolling a bowling ball 
through a field of Ping-Pong balls!) We thus need consider only the positively 
charged atom as a cause of the deflection of the particle. By the same argument, 
we neglect any possible motion of the more massive atom caused by the passage 
of the projectile. The basic experiment, then, is the scattering of a positively 
charged projectile by the stationary positively charged massive part of the atom. 

In practice we cannot do the experiment with one atom. Instead, we bombard a 
thin foil, as in Figure 6.3. The scattering angle 0 that we observe in the laboratory 
is the result of scattering by many atoms, with impact parameters that we do 
not know and cannot control. Let’s assume that for a single atom the average 
scattering angle is 6,,, which represents an average over all possible impact 
parameters from zero up to the atomic radius R. For a typical foil thickness of 
1 zm (10~° m), the projectile is scattered by about 10* atoms. 

The total scattering angle 0 is determined by statistical considerations, because 
some of the individual scatterings move the projectile toward larger scattering 
angles and some toward smaller angles, as represented in Figure 6.3. This is an 
example of a “random walk” problem—for N scatterings, the most likely observed 
net scattering angle @ is related to the average individual scattering angle by 


0 =~ JNO,y (6.1) 


According to the Thomson model, the average scattering angle for a single atom 
is on the order of 0.01°, and for a foil that is 10* atoms thick the net scattering 
angle should be about 1°. This is consistent with experimental observations. 

The most critical test of the Thomson model, which it fails completely, occurs 
when we examine the probability for scattering at large angles. If each individual 
scattering deflects the projectile through an angle of around 0.01°, then to observe 
projectiles scattered through a total angle greater than 90°, we must have about 
10* successive scatterings, all of which push the projectile toward larger angles. 
Because the probabilities of individual scatterings toward either larger or smaller 
angles are equal, the probability of having 10* successive scatterings toward 
larger angles, like the probability of finding 10* successive heads in tossing a 
coin, is about (1/2)!9009 — 19-3000, 

An experiment to observe this scattering was performed by Hans Geiger and 
Ernest Marsden in the laboratory of Ernest Rutherford at Manchester University 
in 1910. For projectiles they used alpha particles, which are nuclei of helium (of 
charge +2e) emitted in radioactive decay. Their results showed that the probability 
of an alpha particle scattering at angles greater than 90° was about 10~*. This 
remarkable discrepancy between the expected value based on the Thomson model 
(10~3°°) and the observed value (10~*) was described by Rutherford in this way: 


It was quite the most incredible event that ever happened to me in my life. 
It was as incredible as if you fired a 15-inch shell at a piece of tissue paper 
and it came back and hit you. 


The analysis of the results of such scattering experiments led Rutherford to propose 
that the mass and positive charge of the atom are not distributed uniformly over 
the volume of the atom, but instead are concentrated in an extremely small region, 
about 10~!4 m in diameter, at the center of the atom. In Section 6.3 we will see 
how this proposal is consistent with the large-angle scattering results. 
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Scattering in the Thomson Model (Optional) 


Let’s assume that a projectile of positive charge ze is incident on an atom of 
radius R that we represent according to the Thomson model as a uniform sphere 
of positive charge Ze. The force on the projectile when it is a distance r from the 
center of the atom can be computed using Gauss’s law (see Problem 2): 


zZe* 


= —_ 6.2 
Ané)R3 “ 


Before discussing the scattering, we should note that this equation can also describe 
(if we put z = 1) the force on an electron embedded in the Thomson atom at a 
distance r from its center. This force can be written F = kr with k = Ze? /4ze,R?. 
This linear restoring force permits the electrons to oscillate about their equilibrium 
positions just like a mass on a spring subject to the linear restoring force F = kx. We 
therefore expect the electrons in the Thomson atom to oscillate about their equilib- 
rium positions with a frequency f = (27)~!,/k/m, where k is the force constant. 
Because an oscillating electric charge radiates electromagnetic waves whose fre- 
quency is identical to the oscillation frequency, we might expect, based on the 
Thomson model, that the radiation emitted by atoms would show this characteristic 
frequency. This turns out not to be true (see Problem 3); the calculated frequencies 
do not correspond to the frequencies observed for radiation emitted by atoms. 

The exact calculation of the scattering angle for different values of the impact 
parameter in the Thomson model of the atom is fairly complicated, but for our 
purposes we want only an estimate of the average value of the angle. As we will 
find out later, it’s not very important if our estimate is off by a small factor. 

Initially the projectile moves in the x direction in the geometry of Figure 6.2, 
but the atom exerts a force in the y direction that produces a small component 
of momentum p, in that direction. Using Newton’s second law we can find the 
momentum from the impulse received by the projectile due to the electrostatic 
force: 


Ly= [* dt (6.3) 


Rather than carry out this complicated integral for a force that is changing in mag- 
nitude and direction as the projectile travels, we’ll estimate the average scattering 
angle by choosing an average value for the impact parameter, namely b = R/2 (rep- 
resenting the middle trajectory of Figure 6.2), and we’ll assume the force acts in 
the y direction for a time A? determined by the projectile’s flight along a line of 
length roughly equal to R. This underestimates the amount of time during which the 
force acts but overestimates the effect of the force (which doesn’t act purely in the 
y direction along the entire trajectory), so to some extent these two effects should 
cancel one another. 
Making these approximations, we obtain 


~  2Ze*(R/2) R 2Ze* 
py = FAt= — 
: AmeyR? v 8me,Rv 





(6.4) 


The angle @ is small, so we can make the approximation tan@d = 0, and we can 
assume that p,. changes very little from its initial value mv, and so the average 
scattering angle is 


Py Py zZe~ 1 zZe* 
6, = tand,, = 2 = 2 
x 








= = 6.5 
mv 8megRvmv — 16me,RK ee) 
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using the nonrelativistic kinetic energy K = $my>. This gives an estimate of the 
scattering angle when the impact parameter 6 is equal to half the radius R. Smaller 
values of b will give smaller deflection angles, and larger values of 6 will give 
larger angles, so this is a reasonable estimate for the average scattering angle for 
a Thomson model atom. 


| Example 6.1 


Using the Thomson model, estimate the average scatter- zZe i 2 a 
ing angle when alpha particles (z = 2) with kinetic energy Oy = ; ee Lie ek 
3 MeV are scattered from gold (Z = 79). The atomic radius 6719 TE () 








of gold is 0.179 nm. _ 0.25(1.44 eV - nm)(2)(79) 
Solution (0.179 nm)(3 x 108 eV) 
Using e”/47¢) = 1.44 eV-nm, we have 1 10 ad = 008" 





Even though this result represents a rough estimate of the average scattering 
angle in the Thomson model of the atom, its accuracy does not affect our 
conclusions about the failure of the model. Even if our estimate were too small by 
as much as a factor of 10 (which is highly unlikely), we would be comparing an 
expected probability of 1072 (instead of 10~3°°) with the observed 107, still 
a spectacular disagreement. Any reasonable estimate shows the complete failure 
of the Thomson model to account for these scattering experiments. 


6.3 THE RUTHERFORD NUCLEAR ATOM 


In analyzing the scattering of alpha particles, Rutherford concluded that the most 
likely way an alpha particle (m = 4u) can be deflected through large angles is 
by a single collision with a more massive object. Rutherford therefore proposed 
that the charge and mass of the atom were concentrated at its center, in a region 
called the nucleus. Figure 6.4 illustrates the scattering geometry in this case. The 
projectile, of charge ze, experiences a repulsive force due to the positively charged 





nucleus: 
_ 1 |q aa = (ze) (Ze) (6.6) 
4ney Ame gr? 
ci 6 (Compare this with Eq. 6.2, which describes a projectile that is inside the sphere 
b | of charge Ze and so feels only a portion of the positive charge. We assume now 
ee vey —-—-+- _ that the projectile is always outside the nucleus, so it feels the full nuclear charge 


Ze.) The atomic electrons, with their small mass, do not appreciably affect the 
FIGURE 6.4 Scattering by a nuclear _ path of the projectile and we neglect their effect on the scattering. We also assume 
atom. The path of the scattered particle that the nucleus is so much more massive than the projectile that it does not move 
is a hyperbola. Smaller impact param- during the scattering process; because no recoil motion is given to the nucleus, 
eters give larger scattering angles. the initial and final kinetic energies K of the projectile are equal. 
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As Figure 6.4 shows, for each impact parameter 5, there is a certain scattering 
angle 6, and we need the relationship between b and @. The projectile can be 
shown* to follow a hyperbolic path; in polar coordinates r and ¢, the equation of 
the hyperbola is 


ie er 6.7) 
rb ne 87é)b2K ai 

As shown in Figure 6.5, the initial position of the particle is ¢ = 0,r > ov, and 
the final position is @ = mz — 0,r — ov. Using the coordinates at the final position, 


Eq. 6.7 reduces to 


lg (6.8) 
50 = = cot 5 - 
Bme,K 2 2K 4ne, 








(This result is written in this form so that e”/42e) = 1.44eV-nm or MeV- fm can 
be easily inserted.) A projectile that approaches the nucleus with impact parameter 
b will be scattered at an angle 6; projectiles approaching with smaller values of b 
will be scattered through larger angles, as shown in Figure 6.4. 

We divide our study of the scattering of charged projectiles by nuclei (which 
is commonly called Rutherford scattering) into three parts: (1) calculation of the 
fraction of projectiles scattered at angles greater than some value of 0, (2) the 
Rutherford scattering formula and its experimental verification, and (3) the closest 
approach of a projectile to the nucleus. 


1. The Fraction of Projectiles Scattered at Angles Greater than 0. From 
Figure 6.4 we see immediately that every projectile with impact parameters less 
than a given value of b will be scattered at angles greater than its corresponding 
8. What is the chance of a projectile having an impact parameter less than a given 
value of b? Suppose the foil were one atom thick—a single layer of atoms packed 
tightly together, as in Figure 6.6. Each atom is represented by a circular disc, of 
area 7 R. If the foil contains N atoms, its total area is Nw R?. For scattering at 
angles greater than 0, the impact parameter must fall between zero and b—that 
is, the projectile must approach the atom within a circular disc of area 1b. If 
the projectiles are spread uniformly over the area of the disc, then the fraction of 
projectiles that fall within that area is just 2 b*/z R?. 

A real scattering foil may be thousands or tens of thousands of atoms thick. 
Let t be the thickness of the foil and A its area, and let o and / be the density and 
molar mass of the material of which the foil is made. The volume of the foil is 
then Af, its mass is pAt, the number of moles is pAt/M, and the number of atoms 
or nuclei per unit volume is 


= \— — = — 6.9 
"".M At M we) 
where N, is Avogadro’s number (the number of atoms per mole). As seen by an 
incident projectile, the number of nuclei per unit area is nt = N, pt/M; that is, on 
the average, each nucleus contributes an area (N, pt/M )—! to the field of view 


*See, for example, R. M. Eisberg and R. Resnick, Quantum Physics of Atoms, Molecules, Solids, 
Nuclei, and Particles, 2nd ed. (New York, Wiley, 1985). 





FIGURE 6.5 The hyperbolic trajec- 
tory of a scattered particle. 





FIGURE 6.6 Scattering geometry for 
many atoms. For impact parameter 5, 
the scattering angle is 0. If the particle 
enters the atom within the disc of area 
mb’, its scattering angle will be larger 
than 0. 
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of the projectile. For scattering at angles greater than 6, it must once again be 
true that the projectile must fall within an area 2b* of the center of an atom; the 
fraction scattered at angles greater than 6 is just the fraction that approaches an 
atom within the area 2b’: 


fea =fng = mind? (6.10) 


assuming that the incident particles are spread uniformly over the area of the foil. 


| Example 6.2 





A gold foil (o = 19.3 g/cm’, M = 197g/mole)hasathick- and using Eq. 6.10 we then have 

ness of 2.0 x 107+ cm. It is used to scatter alpha particles 

of kinetic energy 8.0 MeV. (a) What fraction of the alpha 90° = nea? 

particles is scattered at angles greater than 90°? (b) What = (5.9 x 1028 m~*)(2.0 x 107° m)z (1.4 x 10-'4 m) 
fraction of the alpha particles is scattered at angles between a9 es i9> 

90° and 45°? 


(b) Repeating the calculation for 9 = 45°, we find 








Solution 

(a) For this case the number of nuclei per unit volume can 7 2 

be computed as Se cot 40 

2K Amey 7 
= Nap o (6.02 x 1073 atoms/mole)(19.3 g/cm?) =5 — (1.44 MeV: fm) cot 22.5° 
M (197 g/mole)(1 m/102 cm)? (OMEY) 
= 34fm=3.4x 107" m 
= 5.9 x 10% m3 


foago = nib? 


- 28-3 ~6 -14 2 
For scattering at 90°, the impact parameter 5 can be found =i eee ee ee aan 


from Eq. 6.8: = 4.4 x 10% 
a 1 If a total fraction of 4.4 x 107+ is scattered at angles 
= Se ie cot 59 greater than 45°, and of that, 7.5 x 107° is scattered at 
; angles greater than 90°, the fraction scattered between 45° 
= VN) aa evn ot 45° and 90° must be 
2(8.0 MeV) 
=~ 14fm=14x10-" m 4.4x 10-4 -7.5 x 107° = 3.6 x 10 





2. The Rutherford Scattering Formula and Its Experimental Verification. 
In order to find the probability that a projectile will be scattered into a small 
angular range at 0 (between 0 and 6 + d@), we require that the impact parameter 
lie within a small range of values db at b (see Figure 6.7). The fraction, df is then 


df = nt(2mb db) (6.11) 
from Eq. 6.10. Differentiating Eq. 6.8 we find db in terms of dé: 


zZ e& 
db = ————(— csc’ 16)(1 do 6.12 
2K 4m, esc” 56)(5 d@) (6.12) 
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and so 


d AGG Seery 6.13 
ari =m (=) (<) csc* 56 cot 5 (6.13) 
(This minus sign in Eq. 6.12 is not important—it just tells us that 9 increases 
as b decreases.) Suppose we place a detector for the scattered projectiles at the 
angle 6 a distance r from the nucleus. The probability for a projectile to be 
scattered into the detector depends on df, which gives the probability for scattered 
particles to pass through the ring of radius r sin 6 and width r dO. The area of the 
ring is dA = (27r sin @)r dé. In order to calculate the rate at which projectiles 
are scattered into the detector we must know the probability per unit area for 
scattering into the ring. This is |df|/dA, which we call N(@), and, after some 
manipulation, we find: 








FIGURE 6.7 Particles entering the 
ring between b and b + db are dis- 
tributed uniformly along a ring of 
2 angular width d@. A detector is at 


nt (22 (_e ! distance r from the scattering foil 
N(@) = = (6.14) a distance r tro e scattering foil. 
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This is the Rutherford scattering formula. 

In Rutherford’s laboratory, Hans Geiger and Ernest Marsden tested the predic- 
tions of this formula in a remarkable series of experiments involving the scattering 
of alpha particles (z = 2) from a variety of thin metal foils. In those days before 
electronic recording and processing equipment was available, Geiger and Marsden 
observed and recorded the alpha particles by counting the scintillations (flashes of 
light) produced when the alpha particles struck a zinc sulfide screen. A schematic 
view of their apparatus is shown in Figure 6.8. In all, four predictions of the 
Rutherford scattering formula were tested: 

(a) N(@) xt. With a source of 8-MeV alpha particles from radioactive 
decay, Geiger and Marsden used scattering foils of varying thicknesses ¢ while 
keeping the scattering angle @ fixed at about 25°. Their results are summarized 
in Figure 6.9, and the linear dependence of N(@) on ¢ is apparent. This is 
also evidence that, even at this moderate scattering angle, single scattering is 
much more important than multiple scattering. (In a random statistical theory 
of multiple scattering, the probability for scattering at a large angle would 
be proportional to the square root of the number of single scatterings, and 
we would expect N(6) « t!/?. Figure 6.9 shows clearly that this is not true.) 
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FIGURE 6.8 Schematic diagram of alpha- FIGURE 6.9 The dependence of 
particle scattering experiment. A radioactive scattering rate on foil thickness for 
source of alpha particles is in a shield with three different scattering foils. 


a small hole. Alpha particles strike the foil 
F and are scattered into the angular range 
d@. Each time a scattered particle strikes the 
screen S a flash of light is emitted and ob- 
served with the movable microscope M. 
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FIGURE 6.11. The dependence of 
scattering rate on the kinetic energy 
of the incident alpha particles for scat- 
tering by a single foil. The slope 
of —2 on the log-log scale shows 
that N « K~?, as expected from the 
Rutherford formula. 


This result emphasizes a significant difference between scattering by a Thomson 
model atom and a Rutherford nuclear atom: In the Thomson model, the projectile 
is scattered by every atom along its path as it passes through the foil (see 
Figure 6.3), while in the Rutherford nuclear model the nucleus is so tiny that 
the chance of even a single significant encounter is small and the chance of 
encountering more than one nucleus is negligible. 

(b) N(@) « Z?. In this experiment, Geiger and Marsden used a variety 
of different scattering materials, of approximately (but not exactly) the same 
thickness. This proportionality is therefore much more difficult to test than the 
previous one, since it involves the comparison of different thicknesses of different 
materials. However, as shown in Figure 6.10, the results are consistent with the 
proportionality of N(@) to Z?. 

(c) N(@) « K~?. In order to test this prediction of the Rutherford scattering 
formula, Geiger and Marsden kept the thickness of the scattering foil constant and 
varied the speed of the alpha particles. They accomplished this by slowing down 
the alpha particles emitted from the radioactive source by passing them through 
thin sheets of mica. From independent measurements they knew the effect of 
different thicknesses of mica on the velocity of the alpha particles. The results of 
the experiment are shown in Figure 6.11; once again we see excellent agreement 
with the expected relationship. 

(d) N(@)« sin* 50. This dependence of N on 6@ is perhaps the most important 
and distinctive feature of the Rutherford scattering formula. It also produces the 
largest variation in N over the range accessible by experiment. In the tests 
discussed so far, NV varied by perhaps an order of magnitude; in this case N varies 
by about five orders of magnitude from the smaller to the larger angles. Geiger and 
Marsden used a gold foil and varied 0 from 5 to 150°, to obtain the relationship 
between N and @ plotted in Figure 6.12. The agreement with the Rutherford 
formula is again very good. 

Thus all predictions of the Rutherford scattering formula were confirmed by 
experiment, and the “nuclear atom” was verified. 


3. The Closest Approach of a Projectile to the Nucleus. A positively charged 
projectile slows down as it approaches a nucleus, exchanging part of its initial 
kinetic energy for the electrostatic potential energy due to the nuclear repulsion. 
The closer the projectile gets to the nucleus, the more potential energy it gains, 
because 


— 1 aim 1 zZe 
Anes ro Ane r 





(6.15) 


The maximum potential energy, and thus the minimum kinetic energy, occurs at 
the minimum value of 7. We assume that U = 0 when the projectile is far from the 
nucleus, where it has total energy E = K = smy>. As the projectile approaches 
the nucleus, K decreases and U increases, but U + K remains constant. At the 
distance r,,i,, the speed is Vj, and: 

1 2Ze? 1, 


1 
E=-—m-, =-mv 


6.16 
QE Anes Pace = 2 0.0) 


(See Figure 6.13.) 
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FIGURE 6.12 The dependence of scat- Aneg d 
tering rate on the scattering angle 0, K=0 
using a gold foil. The sin-*(6/2) depen- 
dence is exactly as predicted by the FIGURE 6.13 Closest approach of the 
Rutherford formula. projectile to the nucleus. 


Angular momentum is also conserved. Far from the nucleus, the angular 


momentum L is mvb, and at r,,;,, the angular momentum is MV,,i,/ mins SO 
mvb = MV min! min (6.17) 


which gives Vinin = OV/7 min. Substituting this result into Eq. 6.16, we find 





1 , 1 by 1 zZe* 
mv = =m| > + (6.18) 
2 2 ee ATED min 


This expression can be solved for the value of r,,,- 

Notice that the kinetic energy of the projectile is not zero at ri, unless b = 0. 
(See Figure 6.13.) In this case, the projectile would lose all of its kinetic energy, 
and thus get closest to the nucleus. At this point its distance from the nucleus is 
d, the distance of closest approach. We find this distance by solving Eq. 6.18 for 
rmin When b = 0, and obtain 





_ 1 ae (6.19 
~ Amey K i 
J Example 6.3 
Find the distance of closest approach of an 8.0-MeV alpha Solution 
particle incident on a gold foil. 
zZe* | 





d= — = (2)(79)(1.44 MeV-fm) 
0 K 


Are 


——— = 28 fm 
8.0 MeV 
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Although a distance of 28 fm is very small (much less than an atomic radius, 
for example) it is larger than the nuclear radius of gold (about 7 fm). Thus the 
projectile is always outside of the nuclear charge distribution, and the Rutherford 
scattering law, which was derived assuming the projectile to remain outside the 
nucleus, correctly describes the scattering. If we increase the kinetic energy of 
the projectile, or decrease the electrostatic repulsion by using a target nucleus 
with low Z, this may not be the case. Under certain circumstances, the distance 
of closest approach can be less than the nuclear radius. When this happens, the 
projectile no longer feels the full nuclear charge, and the Rutherford scattering law 
no longer holds. In fact, as we discuss in Chapter 12, this gives us a convenient 
way of measuring the size of the nucleus. 


6.4 LINE SPECTRA 


The radiation from atoms can be classified into continuous spectra and discrete 
or line spectra. In a continuous spectrum, all wavelengths from some minimum, 
perhaps 0, to some maximum, perhaps approaching ov, are emitted. The radiation 
from a hot, glowing object is an example of this category. White light is a mixture 
of all of the different colors of visible light; an object that glows white hot is 
emitting light at all wavelengths of the visible spectrum. If, on the other hand, 
we force an electric discharge in a tube containing a small amount of the gas or 
vapor of a certain element, such as mercury, sodium, or neon, light is emitted at a 
few discrete wavelengths and not at any others. Examples of such emission “line” 
spectra are shown in Figure 6.14. The strong 436 nm (blue) and 546 nm (green) 
lines in the mercury emission spectrum give mercury-vapor street lights their 
blue-green tint; the strong yellow line at 590 nm in the sodium spectrum (which 
is actually a doublet—two very closely spaced lines) gives sodium-vapor street 
lights a softer, yellowish color. The intense red lines of neon are responsible for 
the red color of “neon signs.” 

Another possible experiment is to pass a beam of white light, containing all 
wavelengths, through a sample of a gas. When we do so, we find that certain 





Film 
or 






ee: 











et Ultraviolet Visible 
a ee a ee: 
Slits 200 300 400 500 600 700 


Wavelength (nm) 


FIGURE 6.14 Apparatus for observing emission spectra. Light is emitted when an electric discharge is created in a tube 
containing a vapor of an element. The light passes through a dispersive medium, such as a prism or a diffraction grating, 
which displays the individual component wavelengths at different positions. Sample line spectra are shown for mercury 
and sodium in the visible and near ultraviolet. 
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FIGURE 6.15 Apparatus for observing absorption spectra. A light source produces a continuous range of wavelengths, 
some of which are absorbed by a gaseous element. The light is dispersed, as in Figure 6.14. The result is a continuous 
“rainbow” spectrum, with dark lines at wavelengths where the light was absorbed by the gas. 


wavelengths have been absorbed from the light, and again a line spectrum results. 
In this case there are dark lines, superimposed on the bright continuous spectrum, 
at the wavelengths where the absorption occurred. These wavelengths correspond 
to many (but not all) of the wavelengths seen in the emission spectrum. Examples 
of absorption spectra are shown in Figure 6.15. 

In general, the interpretation of line spectra is very difficult in complex atoms, 
and so we will deal for now with the line spectra of the simplest atom, hydrogen. 
Regularities appear in both the emission and absorption spectra, as shown in 
Figure 6.16. Notice that, as with the mercury and sodium spectra, some lines 
present in the emission spectrum are missing from the absorption spectrum. 
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FIGURE 6.16 Emission and absorption spectral series of hydrogen. Note the 
regularities in the spacing of the spectral lines. The lines get closer together as the 
limit of each series (dashed line) is approached. Only the Lyman series appears 
in the absorption spectrum; all series are present in the emission spectrum. 
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| Example 6.4 


In 1885 Johannes Balmer, a Swiss schoolteacher, noticed (mostly by trial and 
error) that the wavelengths of the group of emission lines of hydrogen in the 
visible region could be calculated very accurately from the formula 


2 





d = (364.5 nm) (n = 3,4,5,...) (6.20) 


ne — 


For example, for n = 3, the formula gives A = 656.1 nm, which corresponds 
exactly to the longest wavelength of the series of hydrogen lines in the visible 
region (see Figure 6.16). This formula is now known as the Balmer formula and 
the series of lines that it fits is called the Balmer series. The wavelength 364.5 nm, 
corresponding to n — ov, is called the series limit (which is shown as the dashed 
line at the left end of the Balmer series in Figure 6.16). 
It was soon discovered that all of the groupings of lines in the hydrogen 
spectrum could be fit with a similar formula of the form 
oe) 
A= imit Ho ne 


(n=nyt+1,n9 +2,9 +3...) (6.21) 


where Aj;,,i 18 the wavelength of the appropriate series limit. For the Balmer series, 
No = 2. The other series are today known as Lyman (ny = 1), Paschen (m9 = 3), 
Brackett (7) = 4), and Pfund (7 = 5). These series of hydrogen spectral lines are 
shown in Figure 6.16. 

Another interesting property of the hydrogen wavelengths is summarized in 
the Ritz combination principle. If we convert the hydrogen emission wavelengths 
to frequencies, we find the curious property that certain pairs of frequencies added 
together give other frequencies that appear in the spectrum. 

Any successful model of the hydrogen atom must be able to explain the 
occurrence of these interesting arithmetic regularities in the emission spectra. 





The series limit of the Paschen series (7) = 3) is 820.1 nm. 
What are the three longest wavelengths of the Paschen 


series? 
Solution 
From Eq. 6.21, 


2 
A = (820.1 nm) 





n 
n2 — 32 


The three longest wavelengths are: 








42 

n=4: jm SOE) ge 
52 

n=5: a nT a 
62 

n=6: A ASO LY ag ae 


(n = 4,5,6,...) 


These transitions are in the infrared region of the electro- 
magnetic spectrum. 
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J Example 6.5 











Show that the longest wavelength of the Balmer series Cc 2.998 x 108 m/s 

and the longest two wavelengths of the Lyman series sat- f= , (121.5nm)(10~9 m/nm) 
isfy the Ritz combination principle. For the Lyman series, 

Arimit = 91.13 nm. = 24.67 x 10'4 Hz 

Solution a 

Using Eq. 6.20 with n = 3, we find the longest wavelength n= 35, 2 SOL 32-12 eee 
of the Balmer series to be 656.1 nm. Converting this to a 2 2.998 x 108 m/s 


frequency, we obtain f= 





x (102.5nm)(10-2 m/nm) 


fe e 2.998 x 108 m/s Hagens = 29.24 x 10!4 Hz 


dX (656.1 nm)(10~? m/nm) 





Adding the smallest frequency of the Lyman series to the 


Using Eq. 6.21 for n = 2 and 3 with ny = 1, we find the smallest frequency of the Balmer series gives the next 


longest two wavelengths of the Lyman series and their Smallest Lyman frequency: 


corresponding frequencies to be 
. ae 24.67 x 10'* Hz + 4.57 x 10!4 Hz = 29.24 x 10'4 Hz 
2 


2 
n=2: A=(91.13 nm) 2a 121.5nm demonstrating the Ritz combination principle. 








6.5 THE BOHR MODEL 


Following Rutherford’s proposal that the mass and positive charge are concen- 

trated in a very small region at the center of the atom, the Danish physicist Niels 

Bohr in 1913 (while working in Rutherford’s laboratory) suggested that the atom 

resembled a miniature planetary system, with the electrons circulating about the 

nucleus like planets circulating about the Sun. The atom thus doesn’t collapse 

under the influence of the electrostatic Coulomb force of the nucleus on the elec- v =e 

trons for the same reason that the solar system doesn’t collapse under the influence ge ~S 

of the gravitational force of the Sun on the planets. In both cases, the attractive force / . 

provides the centripetal acceleration necessary to maintain the orbital motion. ra \ 
As we discuss later, the Bohr model does not give a correct view of the =; \ 

actual structure and properties of atoms, but it represents an important first step ; 

in achieving an understanding of atoms. The correct view requires methods of ya 

\ 


\ 
q 
7 
Zz 


quantum mechanics, which we discuss in Chapter 7. 
We consider for simplicity the hydrogen atom, with a single electron \ r } 
circulating about a nucleus that has a single positive charge, as in Figure 6.17. \ we / 
The radius of the circular orbit is r, and the electron (of mass m) moves with ‘ / 
constant tangential speed v. The attractive Coulomb force provides the centripetal \ of 
acceleration v7 /r, so ~e we 





2 2 
F= 1 laillgal = le ia (6.22) FIGURE 6.17 The Bohr model of the 
Amey 1° Amey 1? i atom (Z = 1 for hydrogen). 
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Niels Bohr (1885—1962, Denmark). 
He developed a successful theory 
of the radiation spectrum of atomic 
hydrogen and also contributed the 
concepts of stationary states and com- 
plementarity to quantum mechanics. 
Later he developed a successful theory 
of nuclear fission. The institute of the- 
oretical physics he founded in Copen- 
hagen attracts scholars from around 
the world. 


Manipulating this equation, we can find the kinetic energy of the electron (we are 
assuming the more massive nucleus to remain at rest—more about this later): 
1 1 2 
K = —mv = ——— (6.23) 
2 87e9 
The potential energy of the electron-nucleus system is the Coulomb potential 
energy: 








1 lL 2 
Si leo : (6.24) 
Arey r Arte y r 
The total energy E = K + U is obtained by adding Eqs. 6.23 and 6.24: 
1 i: 2 1 é& 
|e ey og ees +( <) = (6.25) 
879 Amey r 87g 1 


We have ignored one serious difficulty with this model thus far. Classical 
physics requires that an accelerated electric charge, such as our orbiting electron, 
must continuously radiate electromagnetic energy. As it radiates this energy, its 
total energy would decrease, the electron would spiral in toward the nucleus, 
and the atom would collapse. To overcome this difficulty, Bohr made a bold 
and daring hypothesis—he proposed that there are certain special states of 
motion, called stationary states, in which the electron may exist without radiating 
electromagnetic energy. In these states, according to Bohr, the angular momentum 
L of the electron takes values that are integer multiples of 7. In stationary states, 
the angular momentum of the electron may have magnitude i, 2/,3h,..., but 
never such values as 2.5/ or 3.1f. This is called the quantization of angular 
momentum. 

In a circular orbit, the position vector r that locates the electron relative 
to the nucleus is always perpendicular to its linear momentum p. The angular 
momentum, which is defined as L = r x p, has magnitude L = rp = mvr when 
r is perpendicular to p. Thus Bohr’s postulate is 


mvr = nh (6.26) 


where nv is an integer (n = 1,2,3,...). We can use this expression with Eq. 6.23 
for the kinetic energy 











1 1 h2 1 2 
my? = m( ) So (6.27) 
2 2 \mr 87e 7 
to find a series of allowed values of the radius r: 
Are h? 
= Say (n= 1,2,3,...) (6.28) 
me 
where the Bohr radius dg is defined as 
4m h* 
dy = © = 0.0529 nm (6.29) 
me 


This important result is very different from what we expect from classical 
physics. A satellite may be placed into Earth orbit at any desired radius by 
boosting it to the appropriate altitude and then supplying the proper tangential 
speed. This is not true for an electron’s orbit—only certain radii are allowed by 


the Bohr model. The radius of the electron’s orbit may be do, 4aq, 9a, 16a, and 
so forth, but never 3a or 5.3ag. 
Substituting Eq. 6.28 for r into Eq. 6.25 gives the energy: 


me* 1 —13.60eV 


E, = 32 6h a = ) (n = 1,2,3, oe 3) (6.30) 





The energy levels calculated from Eq. 6.30 are shown in Figure 6.18. The 
electron’s energy is guantized—only certain energy values are possible. In its 
lowest level, with n = 1, the electron has energy E,; = —13.60eV and orbits with 
aradius of 7; = 0.0529 nm. This state is the ground state. The higher states (n = 2 
with E, = —3.40 eV, n = 3 with E; = —1.51 eV, etc.) are the excited states. 

The excitation energy of an excited state n is the energy above the ground 
state, E, — E,. Thus the first excited state (n = 2) has excitation energy 


AE = E, — E, = —3 40 eV — (—13.60 eV) = 10.20 eV 
the second excited state has excitation energy 
AE = E, — E, = —1.51eV — (—13.60 eV) = 12.09 eV 


and so forth. The excitation energy can also be regarded as the amount of energy 
that the atom must absorb for the electron to make an upward jump. For example, 
if the atom absorbs an energy of 10.20 eV when the electron is in the ground state 
(n = 1), the electron will jump upward to the first excited state (n = 2). 

The magnitude of an electron’s energy |£,,| is sometimes called its binding 
energy; for example, the binding energy of an electron in the n = 2 state is 
3.40 eV. If the atom absorbs an amount of energy equal to the binding energy 
of the electron, the electron will be removed from the atom and become a 
free electron. The atom, minus its electron, is called an ion. The amount of 
energy needed to remove an electron from an atom is also called the ionization 
energy. Usually the ionization energy of an atom indicates the energy to remove 
an electron from the ground state. If the atom absorbs more energy than the 
minimum necessary to remove the electron, the excess energy appears as the 
kinetic energy of the now free electron. 

The binding energy can also be regarded as the energy that is released when the 
atom is assembled from an electron and a nucleus that are initially separated by a 
large distance. If we bring an electron from a large distance away (where E = 0) 
and place it in orbit in the state n where its energy has the negative value E,,, 
energy amounting to |£’,| is released, usually in the form of one or more photons. 


The Hydrogen Wavelengths in the 
Bohr Model 


We previously discussed the emission and absorption spectra of atomic hydrogen, 
and our discussion of the Bohr model is not complete without an understanding of 
the origin of these spectra. Bohr postulated that, even though the electron doesn’t 
radiate when it remains in any particular stationary state, it can emit radiation 
when it moves to a lower energy level. In the lower level, the electron has less 
energy than in the original level, and the energy difference appears as a quantum 
of radiation whose energy /Af is equal to the energy difference between the levels. 
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FIGURE 6.18 The energy levels of 
atomic hydrogen, showing the excita- 
tion energy of the electron from n = 1 
to n = 2 and the binding energy of the 
n= 2 electron. 
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; --®~-~_n=n That is, if the electron jumps from n = n, to n = ny, as in Figure 6.19, a photon 
hf < Sn 1 2. 
=A ) 7h appears with energy 
¥ Pn \ 
7 a a S 
- oe aN hf =£,, —2 (6.31) 
i ni n=Nny \ ‘ 
H \ | or, using Eq. 6.30 for the energies, 
| 
= ’ , | fo ( —_— ) (6.32) 
He : 643 eth nin? 
\ \ oe 
* aaa ae The wavelength of the emitted radiation is 
Pats ae eo 8s 642 epic ( nin, ) 2 1 ( nin, ) (6.33) 
FIGURE 6.19 An electron jumps f me \njp— mj) Ry \nt 15 
from the oye n, to the state n as a where R,, is called the Rydberg constant 
photon is emitted. 
4 
me 
= —___ 6.34 
~~ 64m ethic ae 
The presently accepted numerical value is 
R,. = 1.097373 x 10’ m7! 
| Example 6.6 
Find the wavelengths of the transitions from n, = 3 to and forn, = 4 and n, = 2, 
Ny = 2 and from n, = 4 to ny = 2 in atomic hydrogen. l none 
A= — 
Solution Ke (= - =) 
For n,; = 3 and n, = 2, Eq. 6.33 gives 7 i 72 
,- LL ( ) ~ 7.097 x 107m \az— 2 
R, a 
= 486.0nm 
: = 656.1 
= = .lnm 
1.097 x 107 m~! \ 3? — 22 





These wavelengths are remarkably close to the values of the two longest 
wavelengths of the Balmer series (Figure 6.16). In fact, Eq. 6.33 gives 


a 
0 = (364.5 nm) 


2 
ny —4 





for the wavelength of a transition from any state n, to n, = 2. This is identical 
with Eq. 6.21 for the Balmer series. Thus we see that the radiations identified as 
the Balmer series correspond to transitions from higher levels to the n = 2 level. 
Similar identifications can be made for other series of radiations, as shown in 
Figure 6.20. This association between the transitions expected according to the 
Bohr model and the observed wavelengths (as in Figure 6.16) represents a huge 
triumph for the model. 

The Bohr formulas also explain the Ritz combination principle, according to 
which certain frequencies in the emission spectrum can be summed to give other 
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FIGURE 6.20 The transitions of the Lyman and Balmer series in hydrogen. The 
series limit is shown at the right of each group. 


frequencies. Let us consider a transition from a state n to a state 1, that is followed 
by a transition from 1, to n,. Equation 6.32 can be used for this case to give 


1 1 
Tietig = CR ne ~~ ne 


1 1 
; ee = cRy (= ~ =) 


My My 


Thus 
1 1 1 1 1 1 
Dini ig = Oe nm — rv chy me — = =cR,, e a me 


which is equal to the frequency of the single photon emitted in a direct transition 
from n; to n,, so 


Sis HZ ae > HY = fn; ny (6.35) 





The Bohr model is thus entirely consistent with the Ritz combination principle. 
The frequency of an emitted photon is related to its energy by E = hf, so the 
summing of frequencies is equivalent to the summing of energies. We may thus 
restate the Ritz combination principle in terms of energy: The energy of a photon 
emitted in a transition that skips or crosses over one or more states is equal to the 
step-by-step sum of the energies of the transitions connecting all of the individual 
states. (See Problem 25.) 
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The Bohr model also helps us understand why the atom doesn’t absorb and 
emit radiation at all the same wavelengths. Isolated atoms are normally found 
only in the ground state; the excited states live for a very short time (less than 
10~° s) before decaying to the ground state. The absorption spectrum therefore 
contains only transitions from the ground state. From Figure 6.20, we see that 
only the radiations of the Lyman series can be found in the absorption spectrum of 
hydrogen. A hydrogen atom in its ground state can absorb radiation of 10.20 eV 
and reach the first excited state, or of 12.09 eV and reach the second excited state, 
and so forth. A hydrogen atom cannot absorb a photon of energy 1.89 eV (the first 
line of the Balmer series), because the atom is originally not in the n = 2 level. 
The Balmer series is therefore not found in the absorption spectrum. 


Atoms with Z > 1 


The Bohr theory for hydrogen can be used for any atom with a single electron, 
even if the nuclear charge Z is greater than 1. For example, we can calculate 
the energy levels of singly ionized helium (helium with one electron removed), 
doubly ionized lithium, and so on. The nuclear electric charge enters the Bohr 
theory in only one place—in the expression for the electrostatic force between 
nucleus and electron, Eq. 6.22. For a nucleus of charge Ze, the Coulomb force 
acting on the electron is 


— 1} lallel_ 1 Ze? 
Amey Ame, 1? 





(6.36) 


That is, where we had e* previously, we now have Ze* 


substitution in the final results, we can find the allowed radii: 


. Making the same 


2 2 
_ Aregh 2 aon 











= = 6.37 
i Fem” Z may 
and the energies become 
m(Ze*)? 1 2 
Bo = —(13.60 eV) — 6.38 
epee ee) 


The orbits in the higher-Z atoms are closer to the nucleus and have larger 
(negative) energies; that is, the electron is more tightly bound to the nucleus. 


| Example 6.7 


Calculate the two longest wavelengths of the Balmer series energies of the radiations and their corresponding wave- 


of triply ionized beryllium (Z = 4). lengths are 
; {1 1 
Solution E; — Ey = —(13.60 eV)(4°) | = — ay ) = 30.2eV 
The radiations of the Balmer series end with the n = 2 3 2 
level, and so the two longest wavelengths are the radiations he 1240eV-nm 41.0 
SS es = Unm 


corresponding ton = 3 > n= 2andn=4— n=2. The E 30.2 eV 
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These radiations are in the ultraviolet region. 
Note that we cannot use Eq. 6.33 to find the wavelengths, 


because that equation applies only to hydrogen (Z = 1). 


1 1 
E, — E) = —(13.60eV)(4’) (3 - z) = 40.8eV 
gs A sg aie 


E 40.8 eV 





6.6 THE FRANCK-HERTZ EXPERIMENT 


Let us imagine the following experiment, performed with the apparatus shown 
schematically in Figure 6.21. A filament heats the cathode, which then emits 
electrons. These electrons are accelerated toward the grid by the potential 
difference V, which we control. Electrons pass through the grid and reach the 
plate if V exceeds Vj, a small retarding voltage between the grid and the plate. 
The current of electrons reaching the plate is measured using the ammeter A. 
Now suppose the tube is filled with atomic hydrogen gas at a low pressure. As 
the voltage is increased from zero, more and more electrons reach the plate, and 
the current rises accordingly. The electrons inside the tube may make collisions 
with atoms of hydrogen, but lose no energy in these collisions —the collisions are 
perfectly elastic. The only way the electron can give up energy in a collision is if 
the electron has enough energy to cause the hydrogen atom to make a transition to 
an excited state. Thus, when the energy of the electrons reaches and barely exceeds 
10.2 eV (or when the voltage reaches 10.2 V), the electrons can make inelastic 
collisions, leaving 10.2 eV of energy with the atom (now in the n = 2 level), and 
the original electron moves off with very little energy. If it should pass through the 
grid, the electron might not have sufficient energy to overcome the small retarding 
potential and reach the plate. Thus when V = 10.2 V, a drop in the current 
is observed. As V is increased further, we begin to see the effects of multiple 
collisions. That is, when V = 20.4 V, an electron can make an inelastic collision, 
leaving the atom in the n = 2 state. The electron loses 10.2 eV of energy in this 
process, and so it moves off after the collision with a remaining 10.2 eV of energy, 
which is sufficient to excite a second hydrogen atom in an inelastic collision. Thus, 
if a drop in the current is observed at V, similar drops are observed at 2V,3V,.... 
This experiment should thus give rather direct evidence for the existence of 
atomic excited states. Unfortunately, it is not easy to do this experiment with 
hydrogen, because hydrogen occurs naturally in the molecular form H,, rather 
than in atomic form. The molecules can absorb energy in a variety of ways, which 
would confuse the interpretation of the experiment. A similar experiment was 
done in 1914 by James Franck and Gustav Hertz, using a tube filled with mercury 
vapor. Their results are shown in Figure 6.22, which gives clear evidence for an 
excited state at 4.9eV; whenever the voltage is a multiple of 4.9 V, a drop in 
the current appears. Coincidentally, the emission spectrum of mercury shows an 
intense ultraviolet line of wavelength 254nm, which corresponds to an energy 
of 4.9 eV; this results from a transition between the same 4.9-eV excited state 
and the ground state. The Franck-Hertz experiment showed that an electron must 
have a certain minimum energy to make an inelastic collision with an atom; we 
now interpret that minimum energy as the energy of an excited state of the atom. 
Franck and Hertz were awarded the 1925 Nobel Prize in physics for this work. 

























FIGURE 6.21 Franck-Hertz appara- 
tus. Electrons leave the cathode C, are 
accelerated by the voltage V toward 
the grid G, and reach the plate P where 
they are recorded on the ammeter A. 
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FIGURE 6.22 Result of Franck-Hertz 
experiment using mercury vapor. The 
current drops at voltages of 4.9V, 
98V (=2x49V),14.7V (=3x 
4.9V). 
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*6.7 THE CORRESPONDENCE PRINCIPLE 


We have seen how Bohr’s model permits calculations of transition wavelengths in 
atomic hydrogen that are in excellent agreement with the wavelengths observed in 
the emission and absorption spectra. However, in order to obtain this agreement, 
Bohr had to introduce postulates that were radical departures from classical 
physics. In particular, according to classical physics an accelerated charged 
particle radiates electromagnetic energy, but an electron in Bohr’s atomic model, 
accelerated as it moves in a circular orbit, does not radiate (unless it jumps to 
another orbit). Here we have a very different case than we did in our study 
of special relativity. You will recall, for example, that relativity gives us one 
expression for the kinetic energy, K = E — Ep, and classical physics gives us 
another, K = 5mv*; however, we showed that E — Ey reduces to 5mv” when 
v <c. Thus these two expressions are really not very different—one is merely 
a special case of the other. The dilemma associated with the accelerated electron 
is not simply a matter of atomic physics (as an example of quantum physics) 
being a special case of classical physics. Either the accelerated charge radiates, 
or it doesn’t! Bohr’s solution to this serious dilemma was to propose the 
correspondence principle, which states that 


Quantum theory must agree with classical theory in the limit in which 
classical theory is known to agree with experiment, 


or equivalently, 


Quantum theory must agree with classical theory in the limit of large 
quantum numbers. 


Let us see how we can apply this principle to the Bohr atom. According to 
classical physics, an electric charge moving in a circle radiates at a frequency 
equal to its frequency of rotation. For an atomic orbit, the period of revolution is 
the distance traveled in one orbit, 27r, divided by the orbital speed v = ./2K/m, 
where K is the kinetic energy: 
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where we use Eq. 6.23 for the kinetic energy. The frequency / is the inverse of 
the period: 
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Using Eq. 6.28 for the radii of the allowed orbits, we find 
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A “classical” electron moving in an orbit of radius 7, would radiate at this 
frequency /,,. 


*This is an optional section that may be skipped without loss of continuity. 
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If we made the radius of the Bohr atom so large that it went from a quantum- 
sized object (10~!° m) to a laboratory-sized object (10~* m), the atom should 
behave classically. The radius increases with increasing n like n7, so this classical 
behavior should occur for n in the range 10°—10*. Let us then calculate the 
frequency of the radiation emitted by such an atom when the electron drops from 
the orbit n to the orbit n — 1. According to Eq. 6.32, the frequency is 
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If n is very large, then we can approximate n — | by n and 2n — 1 by 2n, which 
gives 
me* 2n met 
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This is identical with Eq. 6.41 for the “classical” frequency. The “classical” 
electron spirals slowly in toward the nucleus, radiating at the frequency given by 
Eq. 6.41, while the “quantum” electron jumps from the orbit 7 to the orbit n — 1 
and then to the orbit n — 2, and so forth, radiating at the frequency given by the 
identical Eq. 6.43. (When the circular orbits are very large, this jumping from 
one circular orbit to the next smaller one looks very much like a spiral, as in 
Figure 6.23.) 

In the region of large n, where classical and quantum physics overlap, the 
classical and quantum expressions for the radiation frequencies are identical. iain 6 99 (Top) A large quantum 
This is an example of an application of Bohr’s correspondence principle. The 
applications of the correspondence principle go far beyond the Bohr atom, and 
this principle is important in understanding how we get from the domain in which 
the laws of classical physics are valid to the domain in which the laws of quantum 
physics are valid. 


6.8 DEFICIENCIES OF THE BOHR MODEL 


The Bohr model gives us a picture of how electrons move about the nucleus, 
and many of our attempts to explain the behavior of atoms refer to this picture, 
even though it is not strictly correct. Our presentation ignored two effects that 
must be included to improve the accuracy of the model. Other deficiencies in the 
model cannot be so easily fixed, because they are inconsistent with the correct 
quantum-mechanical picture, which is presented in the next chapter. 


atom. Photons are emitted in discrete 
transitions as the electron jumps to 
lower states. (Bottom) A classical 
atom. Photons are emitted continu- 
ously by the accelerated electron. 


1. Motion of the Proton. Our model was based on an electron orbiting around 
a fixed proton, but actually the electron and proton both orbit about their center 
of mass (just as the Earth and Sun orbit around their center of mass). The kinetic 
energy should thus include a term describing the motion of the proton. We can 
account for this effect if the mass that appears in the equation for the energy 
levels (Eq. 6.30) is not the electron mass but instead is the reduced mass of the 
proton-electron system, calculated from the electron mass m, and proton mass m 
according to 


P 


mom 
m= —?- (6.44) 
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The reduced mass is just slightly smaller than the electron mass and has the 
effect of decreasing the energy and frequency or increasing the wavelength by 
about 0.05%. Equivalently, in Eq. 6.33 for the wavelengths we can replace the 
Rydberg constant R,, (so called because it would be correct if the proton mass 
were infinite) with the value R = R,,(1 + m,/my,). 

2. Wavelengths in Air. Another small but easily fixable error occurs when we 
convert the frequencies (Eq. 6.32) calculated directly from the Bohr energy levels 
to wavelengths (Eq. 6.33). The wavelength measurements are normally done in 
air, so we should calculate the wavelength as A = v,;,/f, where v,;, is the speed of 
light in air. This has the effect of decreasing the calculated wavelengths by about 
0.03% (to some extent offsetting the error we made by ignoring the motion of the 
proton). 

3. Angular Momentum. A serious failure of the Bohr model is that it gives 
incorrect predictions for the angular momentum of the electron. In Bohr’s theory, 
the orbital angular momentum is quantized in integer multiples of fi, which is 
correct. However, for the ground state of hydrogen (m = 1), the Bohr theory gives 
L =h, while experiment clearly shows L = 0. 

4. Uncertainty. Another deficiency of the model is that it violates the uncer- 
tainty relationship. (In Bohr’s defense, remember that the model was developed a 
decade before the introduction of wave mechanics, with its accompanying ideas 
of uncertainty.) Suppose the electron orbits in the xy plane. In this case we know 
exactly its z coordinate (in the xy plane, z = 0 and so Az = 0) and the z component 
of its momentum (also precisely zero, so Ap, = 0). Such an atom would therefore 
violate the uncertainty relationship AzAp, > h. In fact, as we discuss in the next 
chapter, quantum mechanics introduces a degree of “fuzziness” to the behavior 
of electrons in atoms that is not consistent with any orbit in a single plane. 


In spite of its successes, the Bohr model is at best an incomplete model. It is 
useful only for atoms that contain one electron (hydrogen, singly ionized helium, 
doubly ionized lithium, and so forth), but not for atoms with two or more electrons, 
because we have considered only the force between electron and nucleus, and 
not the force between the electrons themselves. Furthermore, if we look very 
carefully at the emission spectrum, we find that many lines are in fact not single 
lines, but very closely spaced combinations of two or more lines; the Bohr model 
is unable to account for these doublets of spectral lines. The model is also limited 
in its usefulness as a basis from which to calculate other properties of the atom; 
although we can accurately calculate the energies of the spectral lines, we cannot 
calculate their intensities. For example, how often will an electron in the n = 3 
state jump directly to the n = | state, emitting the corresponding photon, and how 
often will it jump first to the n = 2 state and then to the n = | state, emitting two 
photons? A complete theory should provide a way to calculate this property. 

We do not wish, however, to discard the model completely. The Bohr model 
provides a useful starting point in our study of atoms, and Bohr introduced several 
ideas (stationary states, quantization of angular momentum, correspondence 
principle) that carry over into the correct quantum-mechanical calculation. There 
are many atomic properties, especially those associated with magnetism, that can 
be simply modeled on the basis of Bohr orbits. Most remarkably, when we treat 
the hydrogen atom correctly in the next chapter using quantum mechanics, we 
find that the energy levels calculated by solving the Schrédinger equation are in 
fact identical with those of the Bohr model. 
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Does the Thomson model fail at large scattering angles or at 
small scattering angles? Why? 

What principles of physics would be violated if we scattered 
a beam of alpha particles with a single impact parameter 
from a single target atom at rest? 

Could we use the Rutherford scattering formula to analyze 
the scattering of (a) protons incident on iron? (b) Alpha par- 
ticles incident on lithium (Z = 3)? (c) Silver nuclei incident 
on gold? (d) Hydrogen atoms incident on gold? (e) Electrons 
incident on gold? 

What determines the angular range d@ in the alpha-particle 
scattering experiment (Figure 6.8)? 

Why didn’t Bohr use the concept of de Broglie waves in his 
theory? 

In which Bohr orbit does the electron have the largest 
velocity? Are we justified in treating the electron nonrela- 
tivistically in that case? 

How does an electron in hydrogen get from r= 4a) to 
r = dp without being anywhere in between? 

How is the quantization of the energy in the hydrogen 
atom similar to the quantization of the systems discussed 


10. 


11. 
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in Chapter 5? How is it different? Do the quantizations 
originate from similar causes? 

Ina Bohr atom, an electron jumps from state n,, with angular 
momentum 7,f, to state n,, with angular momentum nh. 
How can an isolated system change its angular momentum? 
(In classical physics, a change in angular momentum requires 
an external torque.) Can the photon carry away the differ- 
ence in angular momentum? Estimate the maximum angular 
momentum, relative to the center of the atom, that the pho- 
ton can have. Does this suggest another failure of the Bohr 
model? 

The product £,,r,, for the hydrogen atom is (1) independent 
of Planck’s constant and (2) independent of the quantum 
number n. Does this observation have any significance? Is 
this a classical or a quantum effect? 

(a) How does a Bohr atom violate the position-momentum 
uncertainty relationship? (b) How does a Bohr atom vio- 
late the energy-time uncertainty relationship? (What is AE? 
What does this imply about At? What do you conclude 
about transitions between levels?) 
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12. 


13. 


14. 


15. 


List the assumptions made in deriving the Bohr theory. 
Which of these are a result of neglecting small quantities? 
Which of these violate basic principles of relativity or 
quantum physics? 

List the assumptions made in deriving the Rutherford scat- 
tering formula. Which of these are a result of neglecting 
small quantities? Which of these violate basic principles of 
relativity or quantum physics? 

In both the Rutherford theory and the Bohr theory, we used 
the classical expression for the kinetic energy. Estimate the 
velocity of an electron in the Bohr atom and of an alpha par- 
ticle in a typical scattering experiment, and decide whether 
the use of the classical formula is justified. 

In both the Rutherford theory and the Bohr theory, we 
neglected any wave properties of the particles. Estimate the 
de Broglie wavelength of an electron in a Bohr atom and 
compare it with the size of the atom. Estimate the de Broglie 
wavelength of an alpha particle and compare it with the 


6.1 Basic Properties of Atoms 


1. 


Electrons in atoms are known to have kinetic energies in 
the range of a few eV. Show that the uncertainty principle 
allows electrons of this energy to be confined in a region the 
size of an atom (0.1 nm). 


6.2 Scattering Experiments and the Thomson Model 


2. 


Consider an electron in Figure 6.1 embedded in a sphere of 
positive charge Ze at a distance r from its center. (a) Using 
Gauss’s law, show that the electric field on the electron due 
to the positive charge is 


1 Ze 
= ——+y 
4m, R3 


(b) For this electric field, show that the force on the electron 
is given by Eq. 6.2. 

(a) Compute the oscillation frequency of the electron and the 
expected absorption or emission wavelength in a Thomson- 
model hydrogen atom. Use R = 0.053 nm. Compare with 
the observed wavelength of the strongest emission and 
absorption line in hydrogen, 122 nm. (b) Repeat for sodium 
(Z = 11). Use R= 0.18nm. Compare with the observed 
wavelength, 590 nm. 

Consider the Thomson model for an atom with 2 electrons. 
Let the electrons be located along a diameter on opposite 
sides of the center of the sphere, each a distance x from the 
center. (a) Show that the configuration is stable if x = R/2. 
(5) Try to construct similar stable configurations for atoms 
with 3, 4, 5, and 6 electrons. 


16. 


17. 


18. 


19. 


size of the nucleus. Is the wave behavior expected to be 
important in either case? 

What is the distinction between binding energy and ioniza- 
tion energy? Between binding energy and excitation energy? 
If you were given the value of the binding energy of a level 
in hydrogen, could you find its excitation energy without 
knowing which level it is? 

Why are the decreases in current in the Franck-Hertz exper- 
iment not sharp? 

As indicated by the Franck-Hertz experiment, the first 
excited state of mercury is at an energy of 4.9eV. Do 
you expect mercury to show absorption lines in the visible 
spectrum? 

Is the correspondence principle a necessary part of quan- 
tum physics or is it merely an accidental agreement of two 
formulas? Where do we draw the line between the world 
of quantum physics and the world of classical, nonquantum 
physics? 


6.3 The Rutherford Nuclear Atom 


3: 


10. 


dal 


Alpha particles of kinetic energy 5.00 MeV are scattered 
at 90° by a gold foil. (a) What is the impact parameter? 
(b) What is the minimum distance between alpha particles 
and gold nucleus? (c) Find the kinetic and potential energies 
at that minimum distance. 

How much kinetic energy must an alpha particle have before 
its distance of closest approach to a gold nucleus is equal to 
the nuclear radius (7.0 x 10~!° m)? 

What is the distance of closest approach when alpha particles 
of kinetic energy 6.0 MeV are scattered by a thin copper foil? 
Protons of energy 5.0 MeV are incident on a silver foil of 
thickness 4.0 x 10~° m. What fraction of the incident pro- 
tons is scattered at angles: (a) Greater than 90°? (b) Greater 
than 10°? (c) Between 5° and 10°? (d) Less than 5°? 
Protons are incident on a copper foil 12 wm thick. (a) What 
should the proton kinetic energy be in order that the dis- 
tance of closest approach equal the nuclear radius (5.0 fm)? 
(b) If the proton energy were 7.5 MeV, what is the impact 
parameter for scattering at 120°? (c) What is the minimum 
distance between proton and nucleus for this case? (7) What 
fraction of the protons is scattered beyond 120°? 

Alpha particles of kinetic energy K are scattered either from 
a gold foil or a silver foil of identical thickness. What is the 
ratio of the number of particles scattered at angles greater 
than 90° by the gold foil to the same number for the silver 
foil? 

The maximum kinetic energy given to the target nucleus will 
occur in a head-on collision with b = 0. (Why?) Estimate 


12. 


13. 


14. 


the maximum kinetic energy given to the target nucleus 
when 8.0 MeV alpha particles are incident on a gold foil. 
Are we justified in neglecting this energy? 

The maximum kinetic energy that an alpha particle can 
transmit to an electron occurs during a head-on collision. 
Compute the kinetic energy lost by an alpha particle of 
kinetic energy 8.0 MeV in a head-on collision with an elec- 
tron at rest. Are we justified in neglecting this energy in the 
Rutherford theory? 

Alpha particles of energy 9.6 MeV are incident on a silver 
foil of thickness 7.0 wm. For a certain value of the impact 
parameter, the alpha particles lose exactly half their incident 
kinetic energy when they reach their minimum separation 
from the nucleus. Find the minimum separation, the impact 
parameter, and the scattering angle. 

Alpha particles of kinetic energy 6.0 MeV are incident at 
a rate of 3.0 x 10’ per second on a gold foil of thickness 
3.0 x 10-® m. A circular detector of diameter 1.0cm is 
placed 12cm from the foil at an angle of 30° with the 
direction of the incident alpha particles. At what rate does 
the detector measure scattered alpha particles? 


6.4 Line Spectra 


15. 


16. 


17. 


The shortest wavelength of the hydrogen Lyman series is 
91.13 nm. Find the three longest wavelengths in this series. 
One of the lines in the Brackett series (series limit = 1458 nm) 
has a wavelength of 1944 nm. Find the next higher and next 
lower wavelengths in this series. 

The longest wavelength in the Pfund series is 7459 nm. Find 
the series limit. 


6.5 The Bohr Model 


18. 


19. 


20. 


21. 


22, 


23: 


24. 


In the n = 3 state of hydrogen, find the electron’s velocity, 
kinetic energy, and potential energy. 

Use the Bohr theory to find the series wavelength limits of 
the Lyman and Paschen series of hydrogen. 

(a) Show that the speed of an electron in the mth Bohr 
orbit of hydrogen is ac/n, where @ is the fine structure 
constant, equal to e”/47te)fic. (b) What would be the speed 
in a hydrogenlike atom with a nuclear charge of Ze? 

An electron is in the n = 5 state of hydrogen. To what states 
can the electron make transitions, and what are the energies 
of the emitted radiations? 

Continue Figure 6.20, showing the transitions of the Paschen 
series and computing their energies and wavelengths. 

A collection of hydrogen atoms in the ground state is illu- 
minated with ultraviolet light of wavelength 59.0 nm. Find 
the kinetic energy of the emitted electrons. 

Find the ionization energy of: (a) the n = 3 level of hydro- 
gen; (b) the n = 2 level of Het (singly ionized helium); (c) 
the n = 4 level of Litt (doubly ionized lithium). 


25. 


26. 


27. 


28. 


29. 
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Use the Bohr formula to find the energy differences 
E(n, > ny) = E,, =£,, and show that (a) E(4 > 2) = 
E(4—> 3) + E3B—> 2); (6) E4> 1) = E42) + 
E(2 — 1). (c) Interpret these results based on the Ritz com- 
bination principle. 

Find the shortest and the longest wavelengths of the Lyman 
series of singly ionized helium. 

Draw an energy-level diagram showing the lowest four lev- 
els of singly ionized helium. Show all possible transitions 
from the levels and label each transition with its wavelength. 
A long time ago, in a galaxy far, far away, electric charge 
had not yet been invented, and atoms were held together 
by gravitational forces. Compute the Bohr radius and the 
n= 2 ton = | transition energy in a gravitationally bound 
hydrogen atom. 

An alternative development of the Bohr theory begins by 
assuming that the stationary states are those for which the 
circumference of the orbit is an integral number of de Broglie 
wavelengths. (a) Show that this condition leads to standing 
de Broglie waves around the orbit. (b) Show that this condi- 
tion gives the angular momentum condition, Eq. 6.26, used 
in the Bohr theory. 


6.6 The Franck-Hertz Experiment 


30. 


31, 


A hypothetical atom has only two excited states, at 4.0 and 
7.0 eV, and has a ground-state ionization energy of 9.0 eV. 
If we used a vapor of such atoms for the Franck-Hertz exper- 
iment, for what voltages would we expect to see decreases 
in the current? List all voltages up to 20 V. 

The first excited state of sodium decays to the ground state 
by emitting a photon of wavelength 590 nm. If sodium vapor 
is used for the Franck-Hertz experiment, at what voltage will 
the first current drop be recorded? 


6.7 The Correspondence Principle 


32. 


33: 


Suppose all of the excited levels of hydrogen had lifetimes 
of 10~% s. As we go to higher and higher excited states, they 
get closer and closer together, and soon they are so close 
in energy that the energy uncertainty of each state becomes 
as large as the energy spacing between states, and we can 
no longer resolve individual states. Find the value of for 
which this occurs. What is the radius of such an atom? 
Compare the frequency of revolution of an electron with 
the frequency of the photons emitted in transitions from 
n to n—1 for (a) n= 10; (6) n= 100; (c) n = 1000; 
(d) n = 10,000. 


6.8 Deficiencies of the Bohr Model 


34. 


What is the difference in wavelength between the first line of 
the Balmer series in ordinary hydrogen (MV = 1.007825 u) 
and in “heavy” hydrogen (M = 2.014102 u)? 
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General Problems 


35. 


36. 


37. 


38. 


A hydrogen atom is in the n = 6 state. (a) Counting all 
possible paths, how many different photon energies can be 
emitted if the atom ends up in the ground state? (b) Suppose 
only An = | transitions were allowed. How many different 
photon energies would be emitted? (c) How many different 
photon energies would occur in a Thomson-model hydrogen 
atom? 

An electron is in the n=8 level of ionized helium. 
(a) Find the three longest wavelengths that are emitted 
when the electron makes a transition from the n = 8 level to 
a lower level. (b) Find the shortest wavelength that can be 
emitted. (c) Find the three longest wavelengths at which the 
electron in the n = 8 level will absorb a photon and move 
to a higher state, if we could somehow keep it in that level 
long enough to absorb. (d) Find the shortest wavelength that 
can be absorbed. 

The lifetimes of the levels in a hydrogen atom are of the 
order of 107° s. Find the energy uncertainty of the first 
excited state and compare it with the energy of the state. 
The following wavelengths are found among the many radia- 
tions emitted by singly ionized helium: 24.30 nm, 25.63 nm, 
102.5nm, 320.4nm. If we group the transitions in helium 
as we did in hydrogen by identifying the final state ny and 
initial state n, to which series does each transition belong? 


39. 


40. 


41. 


42. 


Adjacent wavelengths 72.90 nm and 54.00 nm are found in 
one series of transitions among the radiations emitted by 
doubly-ionized lithium. Find the value of ng for this series 
and find the next wavelength in the series. 

When an atom emits a photon in a transition from a 
state of energy E, to a state of energy E,, the photon 
energy is not precisely equal to E, — E,. Conservation of 
momentum requires that the atom must recoil, and so some 
energy must go into recoil kinetic energy Kp. Show that 
Kp = (E, — Ey)? /2Mc* where M is the mass of the atom. 
Evaluate this recoil energy for the n = 2 ton = | transition 
of hydrogen. 

In a muonic atom, the electron is replaced by a negatively 
charged particle called the muon. The muon mass is 207 
times the electron mass. (a) Ignoring the correction for finite 
nuclear mass, what is the shortest wavelength of the Lyman 
series in a muonic hydrogen atom? In what region of the 
electromagnetic spectrum does this belong? (b) How large 
is the correction for the finite nuclear mass in this case? (See 
the discussion at the beginning of Section 6.8.) 

Consider an atom in which the single electron is replaced 
by a negatively charged muon (m,, = 207m,). What is the 
radius of the first Bohr orbit of a muonic lead atom (Z = 82)? 
Compare with the nuclear radius of about 7 fm. 





THE HYDROGEN ATOM IN 
WAVE MECHANICS 








These computer-generated distributions represent the probability to locate the electron in 
the n = 8 state of hydrogen for angular momentum quantum number / = 2 (top) and / = 6 
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In this chapter we study the solutions of the Schrodinger equation for the 
hydrogen atom. We will see that these solutions, which lead to the same energy 
levels calculated in the Bohr model, differ from the Bohr model by allowing for 
the uncertainty in localizing the electron. 

Other deficiencies of the Bohr model are not so easily eliminated by solving 
the Schrodinger equation. First, the so-called “fine structure” of the spectral lines 
(the splitting of the lines into close-lying doublets) cannot be explained by our 
solutions; the proper explanation of this effect requires the introduction of a new 
property of the electron, the intrinsic spin. Second, the mathematical difficulties 
of solving the Schrédinger equation for atoms containing two or more electrons 
are formidable, so we restrict our discussion in this chapter to one-electron atoms, 
in order to see how wave mechanics enables us to understand some basic atomic 
properties. In the next chapter we discuss the structure of many-electron atoms. 


7.1 A ONE-DIMENSIONAL ATOM 


Quantum mechanics gives us a view of the structure of the hydrogen atom that 
is very different from the Bohr model. In the Bohr model, the electron moves 
about the proton in a circular orbit. Quantum mechanics, on the other hand, does 
not allow a fixed radius or a fixed orbital plane but instead describes the electron 
in terms of a probability density, which leads to an uncertainty in locating the 
electron. 

To analyze the hydrogen atom according to quantum mechanics, we must 
solve the Schrédinger equation for the Coulomb potential energy of the proton 


and the electron: 
2 


U(r) = (7.1) 


Amegr 
Eventually we will discuss the solutions to this three-dimensional problem for 
the hydrogen atom using spherical polar coordinates, but for now let’s look at the 
simpler one-dimensional problem, in which a proton is fixed at the origin (x = 0) 
and an electron moves along the positive x axis. (This doesn’t represent a real 
atom, but it does show how some properties of electron wave functions in atoms 
emerge from solving the Schrédinger equation.) 

In one dimension, the Schrédinger equation for an electron with potential 
energy U(x) = —e*/4m€ 9x would then be 
i aw e 


2m dx? — Amt€yx 





Wx) = E(x) (7.2) 


For a bound state, the wave function must fall to zero as x —> oo. Moreover, in order 
for the second term on the left side to remain finite at x = 0, the wave function must 
be zero at x = 0. The simplest function that satisfies both of these requirements is 
W(x) = Axe~*, where A is the normalization constant. By substituting this trial 
wave function into Eq. 7.2, we find a solution when b = me? /4ze 9h? = 1/ap 
(where dy is the Bohr radius defined in Eq. 6.29). The energy corresponding 
to this wave function is E = —h?b?/2m = —me*/32n7e5h", which happens by 
chance to be identical to the energy of the ground state in the Bohr model 
(Eq. 6.30 for n = 1). 
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FIGURE 7.1 Wave functions and probability densities (shaded areas) for an electron bound in a one-dimensional 
Coulomb potential energy. The horizontal axis represents the distance between the proton and electron in units of dp. 


(a) Ground state. (b) First excited state. (c) Second excited state. 


Figure 7.1a shows this wave function and its corresponding probability density 
|¥(x)|?. There is clearly an uncertainty in specifying the location of the electron. 
The most probable region to find the electron is near x = dp, but there is a nonzero 
probability for the electron to be anywhere in the range 0 < x < ov. This is very 
different from the Bohr model, in which the distance between the proton and 
electron is fixed at the value ap. 

Also shown in the figure are wave functions and probability densities cor- 
responding to the first and second excited states. The wave functions have the 
oscillatory or wavelike property that we expect for quantum wave functions. As 
we go to higher excited states, there are more peaks in the probability density and 
the region of maximum probability moves to larger distances. These same features 
emerge from the solution to the three-dimensional problem. From this simple 
one-dimensional calculation (which does not in any way physically represent 
the real three-dimensional hydrogen atom) you can already see how quantum 
mechanics will resolve some of the difficulties associated with the Bohr model. 


J Example 7.1 





Find the normalization constant of the ground state wave analyzing the hydrogen wave functions: 


function for a particle trapped in the one-dimensional 


Coulomb potential energy. = Ve" de = n! (73) 
0 = cit . 

Solution 

The normalization integral (with b = 1/a) is Using this standard form with n = 2 and c = 2/ag, the 


co [o.@) 
i Mende = f xe 7/4 dy = 1 
0 0 5g 2h 





The integration is in a standard form that is found in integral (2/a9) 
tables and that we will have occasion to use frequently in 


normalization integral becomes 


or =A=2a,°” 
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| Example 7.2 





In the ground state of an electron bound in a one-dimen- analyzing the hydrogen wave functions: 














sional Coulomb potential energy, what is the probability ene 
to find the electron located between x = 0 and x = ay? / xe dx = — : 
. nx"! n(n — 1)x"-? n! 
Solution ee Z OT 
The probability can be found using Eq. 5.10: (7.4) 
a9 4A 4% sftp i 
P(O: ap) = / lWx)|? dx = = / 2672/0 ge The probability is then 
: ee 4[ ea, dx 2 + 
Se a 2/a . * Fa, Ola - 
with the normalization constant from Example 7.1. The “0 ‘ : 
integral is a standard form that we will later find useful for = 0.323 





7.2 ANGULAR MOMENTUM IN THE 


HYDROGEN ATOM 





Angular momentum played a significant role in Bohr’s analysis of the structure of 
the hydrogen atom. Bohr was able to obtain the correct energy levels by assuming 
that in the orbit with quantum number n, the angular momentum of the electron 
is equal to nh. Bohr’s idea about the “quantization of angular momentum” turned 
out to have some correct features, but his analysis is not consistent with the actual 
quantum mechanical nature of angular momentum. 


Angular Momentum of Classical Orbits 


Before considering the angular momentum of an orbiting electron, it is helpful to 
review how angular momentum affects classical orbits, such as those of planets 
or comets about the Sun. Classically, the angular momentum of a particle is 
represented by the vector L = ¥ x p, where F is the position vector that locates 
the particle and p is its linear momentum. The direction of L is perpendicular 
to the plane of the orbit. Along with the energy, the angular momentum remains 
constant as the planet orbits. 
The total energy of the orbital motion determines the average distance of the 
planet from the Sun. For a given total energy, many different orbits are possible, 
from the nearly circular orbit of the Earth to the highly elongated elliptical orbits 
of the comets. These orbits differ in their angular momentum L, which is largest 
t=o0 for the circular orbit and smallest for the elongated ellipse. Figure 7.2 shows 

a variety of planetary orbits having the same total energy but different angular 

momentum. The complete specification of the orbit requires that we give not 

only the magnitude of the angular momentum vector but also its direction; this 
FIGURE 7.2 Planetary orbits of the direction identifies the plane of the orbit. To completely describe the angular 
same energy but different angular momentum vector requires three numbers; for example, we might give the three 
momentum L. As L decreases, the components of L (L,,L,,L,). Equivalently, we might give the magnitude L of the 
elliptical orbits become longer and vector and two angular coordinates that give its direction (similar to latitude and 
thinner. longitude on a sphere). 


L=maximum 
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Angular Momentum in Quantum Mechanics 


Quantum mechanics gives us a very different view of angular momentum. The 
angular momentum properties of a three-dimensional wave function are described 
by two quantum numbers. The first is the angular momentum quantum number 1. 
This quantum number determines the length of the angular momentum vector: 


j= J+ DA U=0,1,2,... (7.5) 


Note that this is very different from the Bohr condition IL = nh In particular, it 
is possible for the quantum vector to have a length of zero, but in the Bohr model 
the minimum length is 7. 

The second number that we use to describe angular momentum in quantum 
mechanics is the magnetic quantum number m,. This quantum number tells 
us about one component of the angular momentum vector, which we usually 
choose to be the z component. The relationship between the z component of 
L and the magnetic quantum number is 


L, = mh (m, = 0, 








_ 





PAD (7.6) 





Note that for each value of / there are 2/ + 1 possible values of m). 

Unlike the classical angular momentum vector, for which we provide an 
exact specification by giving three numbers, the quantum angular momentum is 
described by only two numbers. Clearly two numbers cannot completely identify 
a vector in three-dimensional space, so something is missing from our description 
of the quantum angular momentum. As we discuss later, this missing part of the 
description of the quantum angular momentum vector is directly related to the 
application of the uncertainty principle to angular momentum. 


J Example 7.3 





Compute the length of the angular momentum vectors that For/ = 1, 
represent the orbital motion of an electron in a quantum 


state with / = | and in another state with / = 2. IL| =VJld+)Dh= V2h 
Solution and for / = 2, 

Equation 7.5 gives the relationship between the length of 5 

the vector and the angular momentum quantum number /. IL] = /22 + Dh= Voh 





| Example 7.4 





What are the possible z components of the vector L that z components: L, = 2h,h,0, —h, or — 2h. The length of the 
represents the orbital angular momentum of a state with vector L, as we found previously, is 6. 


1=2? 


Solution 
The possible m, values for /=2 are +2, +1, 0, —1, 
—2, and so the L vector can have any of five possible 
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L,=-2h m= —-2 








FIGURE 7.3 The orientations in space 
and z components of a vector with 
1 = 2. There are five different possible 


orientations. 





FIGURE 7.4 The vector L precesses 
rapidly about the z axis, so that L, 


stays constant, but L, and L, 


indeterminate. 


are 


The components of the vector ] L for / = 2 are illustrated in Figure 7.3. Each 
orientation in space of the vector L corresponds to a different m, value. The polar 
angle @ that the vector L makes with the z axis can be found by referring to the 
figure. With L, = |L| cos 0, we have 


L, mM) 
cos 0 = = = ———— (7.7) 


IL} Vid+) 


using Eq. 7.6 for L, and Eq. 7.5 for (LI. 

This behavior represents a curious aspect of quantum mechanics called spa- 
tial quantization—only certain orientations of angular momentum vectors are 
allowed. The number of these orientations is equal to 2/+ 1 (the number of 
different possible m, values) and the magnitudes of their successive z components 
always differ by 4. For example, an angular momentum state with / = | can have 
m, values of +1, 0, or —1 (corresponding to z components L, = +h,0,—A) and 
thus cos@ = +1/./2,0, or — 1/2. The L vector in this case can have one of 
only three possible orientations relative to the z axis, corresponding to angles of 
45°, 90°, or 135°. This is in contrast to a classical angular momentum vector, which 
can have any possible orientation in space; that is, the angle between a classical 
angular momentum vector and the z axis can take any value between 0 and 180°. 


The Angular Momentum Uncertainty 
Relationship 


In quantum mechanics, the maximum amount of permitted information about the 
angular momentum vector is its length (given by Eq. 7.5) and its z component 
(given by Eq. 7.6). Because the complete description of a vector requires three 
numbers, we are always missing some information about the angular momentum 
of a quantum state. If we specify |L| and L, exactly, then we have no information 
about the other components of L (L, and L,). Any possible outcome of a 
measurement of L, or L,, can therefore occur Gs long as [Li? = 12 + Pe + 12). 
In graphic terms, we can imagine that the tip of the L vector rotates or precesses 
about the z axis so that L, remains fixed but L, and L,, are undetermined, as in 
Figure 7.4. This rotation cannot be directly measured; all we can observe is the 
“smeared out” distribution of values of L, and L,,. 

There is thus an uncertainty or indeterminacy in specifying L that is summarized 
by another form of the uncertainty principle: 


AL, Ad >h (7.8) 


where ¢ is the azimuthal angle shown in Figure 7.4. If we know L, exactly 
(AL, = 0), then we have no knowledge at all of the angle ¢—all values are 
equally probable. This is equivalent to saying that we know nothing at all about 
L,, and Li; whenever one component of L is determined, the other components 
are completely undetermined. 

On the other hand, if we try to construct an angular momentum state in which a 
different component— for example, L,— is completely specified (so that ¢ would 
be known), the state becomes a mixture or superposition of different L, values. In 
effect, we can reduce the uncertainty in ¢ only at the expense of increasing the 
uncertainty in L,. This is exactly the same type of behavior that was described by 
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the other forms of the uncertainty principle; for example, reducing the uncertainty 
in x is always accompanied by an increase in the uncertainty in p,. 

From this discussion you can see why the length of the angular momentum is 
defined according to Eq..7.5 and why, for example, we could not have simply 
defined the length as |L| = /A. If this were possible, then when m, had its 
maximum value (m,=-+/), we would have L, = mi = Ih; the length of the 
vector would then be equal to its z component, and so it must lie along the 
z axis with L, = L, = 0. However, this simultaneous exact knowledge of all 
three components of L violates the angular momentum form of the uncertainty 
principle, and therefore this situation is not permitted to occur. It is therefore 
necessary for the length of L to be greater than //. 


7.3 THE HYDROGEN ATOM WAVE FUNCTIONS 


To find the complete spatial description of the electron in a hydrogen atom, 
we must obtain three-dimensional wave functions. The Schrédinger equation in 
three-dimensional Cartesian coordinates has the following form: 





2 72 2 2 
+ (= = + =) + U(x, y,z) WX, y,z) = Ev (x,y,z) (7.9) 
where w is a function of x, y, and z. The usual procedure for solving a partial 
differential equation of this type is to separate the variables by replacing a function 
of three variables with the product of three functions of one variable— for example, 
w (x,y,z) = X(x)¥(y)Z(z). However, the Coulomb potential energy (Eq. 7.1) 
written in Cartesian coordinates, U(x, y,z) = —e*/4meq,/ x2 4 y? +22 , does not 
lead to a separable solution. 

For this calculation, it is more convenient to work in spherical polar coordinates 
(r, 0, @) instead of Cartesian coordinates (x, y, z). The variables of spherical 
polar coordinates are illustrated in Figure 7.5. This simplification in the solution 
is at the expense of an increased complexity of the Schrédinger equation, which 
becomes: 


[ew 2a 1 a sing 1 ay 
2m| dr2 or Or r* sind a0 00 r2 sin? 6 0g? 
+U(N) V(r, 0,6) = Ew(r, 6,0) 








(7.10) 


where now vw is a function of the spherical polar coordinates r, 0, and @. When 
the potential energy depends only on r (and not on @ or @), as is the case for the 
Coulomb potential energy, we can find solutions that are separable and can be 
factored as 





r,0,@) = R(r)O(0) GA) 711 
nee a — FIGURE 7.5 Spherical polar coordi- 


where the radial function R(r), the polar function ©(@), and the azimuthal _ nates for the hydrogen atom. The pro- 
function ®(@) are each functions of a single variable. This procedure gives three _ ton is at the origin and the electron is 
differential equations, each of a single variable (r, 0, @). at a radius r, in a direction determined 

The quantum state of a particle that moves in a potential energy that depends _ by the polar angle 6 and the azimuthal 
only on r can be described by angular momentum quantum numbers / and m,. angle @. 


204 Chapter 7 | The Hydrogen Atom in Wave Mechanics 


The polar and azimuthal solutions are given by combinations of standard trigono- 
metric functions. The remaining radial function is then obtained from solving the 
radial equation: 


me (@R | 2AR 2 i+ DR 
dr? sor dr 





) Re =ER(r) (7.12) 


2m Aner 2mr? 


The mass that appears in this equation is the reduced mass of the proton-electron 
system defined in Eq. 6.44. 


Quantum Numbers and Wave Functions 


When we solve a three-dimensional equation such as the Schrodinger equation, 
three parameters emerge in a natural way as indices or labels for the solutions, just 
as the single index n emerged from our solution of the one-dimensional infinite 
well in Section 5.4. These indices are the three quantum numbers that label the 
solutions. The three quantum numbers that emerge from the solutions and their 
allowed values are: 


n principal quantum number 1 253 yes 
l angular momentum quantum number 0,1,2,...,n-—1 
m, magnetic quantum number 0,+#1,+2,...,+1 





The principal quantum number z is identical to the quantum number v that we 
obtained in the Bohr model. It determines the quantized energy levels: 


me* 1 


=-— SS 7.13 
won enn ae on 


which is identical to Eq. 6.30. Note that the energy depends only on n and not 
on the other quantum numbers / or m;. The permitted values of the angular 
momentum quantum number / are limited by n (/ ranges from 0 to n — 1) and 
those of the magnetic quantum number m, are limited by /. 

Complete with quantum numbers, the separated solutions of Eq. 7.10 can be 
written 


Vn, Lamy (r,9, ’) = Rig) Orm, MD, (p) (7.14) 


The indices (n, /, m;) are the three quantum numbers that are necessary to describe 
the solutions. Wave functions corresponding to some values of the quantum 
numbers are shown in Table 7.1. The wave functions are written in terms of the 
Bohr radius ag defined in Eq. 6.29. 

For the ground state (n = 1), only / = 0 and m, = 0 are allowed. The complete 
set of quantum numbers for the ground state is then (n, /,m,) = (1,0,0), and the 
wave function for this state is given in the first line of Table 7.1. The first excited 
state (1 = 2) can have / = 0 or/ = 1. For/ = 0, only m, = 0 is allowed. This state 
has quantum numbers (2, 0, 0), and its wave function is given in the second line 
of Table 7.1. For / = 1, we can have m, = 0 or +1. There are thus three possible 
sets of quantum numbers: (2, 1, 0) and (2, 1, +1). The wave functions for these 
states are given in the third and fourth lines of Table 7.1. The second excited state 
(n = 3) can have / = 0 (m, = 0), / = 1 (m; = 0, £1), or / = 2 (m; = 0, £1, £2). 

For the n = 2 level, there are four different possible sets of quantum numbers 
and correspondingly four different wave functions. All of these wave functions 
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TABLE 7.1 Some Hydrogen Atom Wave Functions 
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correspond to the same energy, so the n = 2 level is degenerate. (Degeneracy was 
introduced in Section 5.4.) The n = 3 level is degenerate with nine possible sets 
of quantum numbers. In general, the level with principal quantum number 7 has a 
degeneracy equal to n*. Figure 7.6 illustrates the labeling of the first three levels. 

If different combinations of quantum numbers have exactly the same energy, 
what is the purpose of listing them separately? First, as we discuss in the last 
section of this chapter, the levels are not precisely degenerate, but are separated 


a a i, a a ee 
(3, 0, 0) (3, 1, 1) (3, 1, 0) (3, 1, -1) (3, 2, 2) (3, 2, 1) (3, 2, O) (3,2,-1) (3, 2,-2) 
Se 
(2, 0, O) (2,1, 1) (2, 1, 0) (2, 1,-1) 
-13.6 eV 


(1, 0, 0) 


FIGURE 7.6 The lower energy levels of hydrogen, labeled with the quantum numbers (n,/,m,). The first excited state is 
four-fold degenerate and the second excited state is nine-fold degenerate. 
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FIGURE 7.7 The radial wave functions of then = 1, n = 2, andn = 3 states of hydrogen. The radius coordinate is measured 


in units of dg. 








FIGURE 7.8 The volume element in 
spherical polar coordinates. 


by a very small energy (about 10~* eV). Second, in the study of the transitions 
between the levels, we find that the intensities of the individual transitions 
depend on the quantum numbers of the particular level from which the transition 
originates. Third, and perhaps most important, each of these sets of quantum 
numbers corresponds to a very different wave function, and therefore represents 
a very different state of motion of the electron. These states have different spatial 
probability distributions for locating the electron, and thus can affect many atomic 
properties—for example, the way two atoms can form molecular bonds. 

The radial wave functions for the states listed in Table 7.1 are plotted in 
Figure 7.7. You can readily see the differences in the motion of the electron for 
the different states. For example, in the n = 2 level, the /= 0 and /= 1 wave 
functions have the same energy but their behavior is very different: the / = 1 wave 
function falls to zero at r = 0, but the / = 0 wave function remains nonzero at 
r = 0. The / = 0 electron thus has a much greater probability of being found close 
to (or even inside) the nucleus, which turns out to play a large role in determining 
the rates for certain radioactive decay processes. 


Probability Densities 


As we learned in Chapter 5, the probability of finding the electron in any spatial 
interval is determined by the square of the wave function. For the hydrogen atom, 
|W(r,@, b)|* gives the volume probability density (probability per unit volume) at 
the location (r,6,@). To compute the actual probability of finding the electron, 
we multiply the probability per unit volume by the volume element dV located at 
(r,6,@). In spherical polar coordinates (see Figure 7.8) the volume element is 


dV =r’ sin@dr do do (7.15) 


and therefore the probability to find the electron in the volume element at that 
location is 


ure (7, 0, o)| dV = Rng? 11m, )/"|®n, (p77 sin @dr dé do (7.16) 
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1=0,m,=0 





1=2,m,=0 1=2,m=1 1=2,m,=2 


FIGURE 7.9 Representations of ||? for different sets of quantum numbers. The z axis 
is the vertical direction. The diagrams represent surfaces on which the probability has 
the same value. 


Some representations of the probability density |y(r,@,@)|* are shown in 
Figure 7.9. We can regard these illustrations as representing the “smeared out” 
distribution of electronic charge in the atom, which results from the uncertainty 
in the electron’s location. They also represent the statistical outcomes of a large 
number of measurements of the location of the electron in the atom. These spatial 
distributions have important consequences for the structure of atoms with many 
electrons, which is discussed in Chapter 8, and also for the joining of atoms into 
molecules, which is discussed in Chapter 9. 

In the next two sections, we will separately examine how the probability 
density depends on the radial coordinate and on the angular coordinates. 


7.4 RADIAL PROBABILITY DENSITIES 





Instead of asking about the complete probability density to locate the electron, 
we might want to know the probability to find the electron at a particular distance 
from the nucleus, no matter what the values of 0 and ¢ might be. That is, imagine 
a thin spherical shell of radius r and thickness dr. What is the probability to find 
the electron in the shell between spheres of radius r and r + dr? We define the 
radial probability density P(r) so that the probability to find the electron within 
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0.4 4 


P(r) 


0.274 








P(r) 


that shell is P(r)dr. We can determine the radial probability from the complete 
probability (Eq. 7.16) by integrating over the 6 and ¢ coordinates. In effect, this 
adds up the probabilities for the volume elements at a given r for all 6 and @. 


a1 20 
P(r) dr =|R, (1)??? dr / |Qjm, (A)? sin 6 dO / |®,, (6) de® (7.17) 
0 0 


The 6 and ¢ integrals are each equal to unity, because each of the functions R, ©, 
and @ is individually normalized. Thus the radial probability density is 


PO) =F (Rig (7.18) 


Figure 7.10 shows this function for several of the lowest levels of hydrogen. 

Note that, because of the r* factor, P(r) must be zero at r = 0 even though 
Rr) might not. That is, the probability to locate the electron in a spherical shell 
always goes to zero as r — 0 because the volume of the shell goes to zero, but 
the probability density ||? may be nonzero at r = 0. Moreover, P(r) and |R(r)|? 
convey different information about the electron’s behavior, as you can see by 
comparing Figures 7.7 and 7.10. For example, the radial wave function R(r) for 
n= 1,/ = 0 has its maximum at r = 0, but the radial probability density for that 
state has its maximum at r = do. 

Using the radial probability densities, it is possible to find the average value 
of the radial coordinate, that is, the average distance between the proton and 
the electron (see Problems 30 and 31). These values are indicated by markers 
in Figure 7.10. Notice that the average radial coordinate is about 1.5a9 for the 
n = | wave function and is much greater, about 5ay, for both of the n = 2 wave 
functions. The average radius is greater still, about 12a), for the n = 3 states. 
It appears from these graphs that the average radius depends mostly on n and 
not very much on /. The principal quantum number v thus determines not only 
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FIGURE 7.10 The radial probability density P(r) for the n = 1,n = 2, and n = 3 states of hydrogen. The radius coordinate 
is measured in units of ay. The markers on the horizontal axis show the values of the average radius r,, labeled with the 


value of /. 
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the energy level of the electron, it also determines to a great extent the average 
distance of the electron from the nucleus. As in the Bohr model, this average 
radius varies roughly as n’, so that an n = 2 electron is on the average about 4 
times farther from the nucleus than an n = | electron, an n = 3 electron is about 
9 times farther from the nucleus than an n = 1 electron, and so forth. 

Another measure of the location of the electron is its most probable radius, 
determined from the location at which P(r) has its maximum value. For each 
n, P(r) for the state with / = n — 1 has only a single maximum, which occurs at 
the location of the Bohr orbit, r = n?a9. The following example illustrates this for 


the n = 2 state. 


| Example 7.5 





Prove that the most likely distance from the origin of an 
electron in the n = 2,/ = | state is 4ap. 


Solution 
In the n = 2,/ = 1 level, the radial probability density is 


1? 
PIN=P RAM? =P —4ae™ 
24a, af 


We wish to find where this function has its maximum; in 
the usual fashion, we take the first derivative of P(r) and 
set it equal to zero: 


IP 1 
d. (r) = ; 4 (hentia) 
dr 24a9, dr 


1 . 1 
=—; [ane Bf: rt (-—) on —0 
24a ao 





or 





1 A 
=e 7/40 (4° - “) =0 
24a Ay 


The only solution that yields a maximum isr = 4a. 





| Example 7.6 





For the n = 2 states (J = 0 and / = 1), compare the proba- 
bilities of the electron being found inside the Bohr radius. 


Solution 

For the n=2, /=0 level, P(r)dr= 1 |Ry 9) [7dr 
2 

= on (2 - «) e~'/%dr, The total probability of find- 

ing the electron between r = 0 andr = ap is 


P(O:a9) = [ P(r) dr 


Evaluating the integrals using Eq. 7.4, we obtain 
P(0: ay) = 0.034 


For the n=2, J=1 level P(r)dr= ?|Ry Pdr 
I Pang ne! “0dr, The total probability between r = 0 
and 7 = dp is 


P(O:a)) = [ P(r) dr 





1 40 pA 
/ Y e1/% dy = 0.0037 
0 


= 3 2 
24a a 
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The results of Example 7.6 are consistent with the radial probability densities 
shown in Figure 7.10—in the / = 0 state, the probability of finding the electron 
inside dy is about 10 times larger than in the / = | state, as suggested by the small 
peak in the radial probability density for n = 2,/ = 0 at small r. There is clearly 
more area under the P(r) curve between r = 0 and r = ay for n = 2,/ = 0 than 
there is form = 2,/ = 1. 

Curiously, Figure 7.10 also shows that for n = 2 the / = 0 radial probability 
density is also greater than the / = | probability density at large r. Thus the / = 0 
electron spends more time close to the nucleus than the / = | electron and it also 
spends more time farther away. This is a general result that holds for any value 
of n: the smaller the / value, the larger is the probability to find the electron both 
close to the nucleus and far from the nucleus. The classical planetary orbits of 
Figure 7.2 show the same type of behavior—the orbit with L = 0 spends more 
time close to the central body and also more time far away from the central body, 
compared with the orbits that have larger L. 


7.5 ANGULAR PROBABILITY DENSITIES 


In this section, we consider the angular part of the probability density, which is 
obtained from the squared magnitudes of the angular parts of the wave function: 


PO.) = |Ojm, (0) Pm (PIP (7.19) 


Figure 7.11 shows the angular probablity densities for the / = 0 and / = | wave 
functions listed in Table 7.1. 

Note that all of the probability densities are cylindrically symmetric—there 
is no dependence on the azimuthal angle ¢. The / = 0 wave function is also 
spherically symmetric—that is, the probability density is independent of direction. 

The /=1 probability densities have two distinct shapes. For m, = 0, the 
electron is found primarily in two regions of maximum probability along the 
positive and negative z axis, while for m,; = +1, the electron is found primarily 
near the xy plane. For m, = 0, the electron’s angular momentum vector lies in the 
xy plane (Figure 7.3). Classically, the angular momentum vector is perpendicular 
to the orbital plane, so it should not be surprising that the electron is most likely 
to be found in a location away from the xy plane—that is, along the z axis. For 








FIGURE 7.11 The angular dependence of the / = 0 and / = 1 prob- 
ability densities. 


7.6 | Intrinsic Spin 211 


m, = +1, the angular momentum vector has its maximum projection along the z 
axis; the electron, again orbiting perpendicular to L, spends most of its time near 
the xy plane. These probability densities for locating the electron are consistent 
with the information given by the orientation of the angular momentum vector, 
and the cylindrical symmetry of the probability densities is consistent with the 


uncertainty in the knowledge of the orientation of L represented in Figure 7.4. 


| Example 7.7 





For the n = 2,/= 1 wave functions, find the direction 
in space at which the maximum probability occurs when 
m, = 0 and when m, = +1. 


Solution 
For 1 = I,m, =0 we have P(0,) = |@r9(6)p(6)/? = 
7; cos’. To find the location of the maximum, we set 


dP/d0@ equal to zero: 


dP 3 

a Aq (20088 sin) = 0 

There are two solutions to this equation: one for cos 6 = 0, 
for which 0 = 2/2, and another for sin 0 = 0, which gives 
6 = 0 or z. By taking the second derivative, we find that 


@ = 2/2 leads to a minimum while 6 = 0 or z gives the 
maximum. There are thus two regions of maximum prob- 
ability, one along the positive z axis (9 = 0) and another 
along the negative z axis (9 = zr), as in Figure 7.11. 

For /= 1,m, = +1 the angular probability density is 
P(0,$) = |Oo,4;(0) 4, (6)? = ¢ sin’ 6. We can then 
find the location of the maximum: 














dP 2... 

77 = qe cosé) = 0 

Once again there are two solutions: 0 = 0,7 or 0 = 77/2. 
However, in this case the maximum occurs for 0 = 2/2 
and the probability maximum occurs in the xy plane, as in 
Figure 7.11. 





7.6 INTRINSIC SPIN 


One way of observing spatial quantization is to place the atom in an externally 
applied magnetic field. From the interaction between the magnetic field and the 
magnetic dipole moment of the atom (which is related to the electron’s orbital 
angular momentum), it is possible both to observe the separate components of 
L and also to determine / by counting the number of z components (which, as 
we have seen, is equal to 27+ 1). However, when this experiment is done, a 
surprising result emerges that indicates an unexpected property of the electron, 
known as intrinsic spin. 


ml 


Orbital Magnetic Dipole Moments 


Figure 7.12 shows a classical magnetic dipole moment, which might be produced : 
by acurrent loop or the orbital motion of a charged object. The classical magnetic Fl 

dipole moment ji is defined as a vector whose magnitude is equal to the product 
of the circulating current and the area enclosed by the orbital loop. The direction 
of is perpendicular to the plane of the orbit, determined by the right-hand 
rule—with the fingers in the direction of the conventional (positive) current, the 
thumb indicates the direction of (1, as shown in Figure 7.12 for a circulating 
negative charge like an electron. 


FIGURE 7.12 A circulating negative 
charge is represented as a current loop. 
Because the charge is negative, L and 
{L point in opposite directions. 
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N 





FIGURE 7.13 According to quantum 
mechanics, the vectors can be consid- 
ered to precess around the z axis, and 
so we can specify only the z compo- 
nents of L and pt. 





As we have seen, quantum mechanics forbids exact knowledge of the direction 
of L and therefore of ji ww. Figure 7.13 suggests the relationship between L and 
jt that is consistent with quantum mechanics. Only the z components of these 
vectors can be specified. Because the electron has a negative charge, L and Ts 
have z components of opposite signs. 

We can use the Bohr model with a circular orbit to obtain the relationship 
between L and it, which turns out to be identical with the correct quantum 
mechanical result. We regard the circulating electron as a circular loop of current 
i = dq/dt = q/T, where q is the charge of the electron (—e) and T is the time for 
one circuit around the loop. If the electron moves with speed v = p/m around 
a loop of radius r, then 7 = 2mr/v = 2mrm/p. The magnitude of the magnetic 
moment is 


q -s q 
= SS p= Lb 7.20 
Inrm/p 2m? ~ | ven 





[=iA 


with IL = rp. Writing Eq. 7.20 in terms of vectors and putting —e for the 
electronic charge, we obtain 


> 


ef (7.21) 
2m 
The negative sign, which is present because the electron has a negative charge, 
indicates that the vectors L and ji; point in opposite directions. The subscript 
L on fi, reminds us that this magnetic moment arises from the orbital angular 
momentum L of the electron. 
The z component of the magnetic moment is 





e e eh 
fits = ae a mh= coe mM, = —M)/Lp (7.22) 
The quantity e/i/2m is defined to be the Bohr magneton 
eh 
Bp = = (7.23) 
m 


The value of [up is 
[p= 9.274 x 10-4 J/T 


The Bohr magneton is a convenient unit for expressing atomic magnetic moments, 
which typically have values of the order of pp. 


A Dipole in an External Field 


Before we consider further the behavior of fins we discuss the similar behavior of 
an electric dipole, which consists of two equal and opposite charges q separated 
by a distance r. The electric dipole moment p has magnitude gr and points from 
the negative charge to the positive charge. As shown in Figure 7. 14a, in a uniform 
electric field, the vertical forces F, on the positive charge and F_ on the negative 
charge are of equal magnitude. The dipole experiences a torque that tends to rotate 
it into alignment with E, but the net force on the dipole is zero. Suppose now 
that the field is not uniform—for example, the field strength decreases from the 
bottom of the figure to the top, as in Figure 7.14b. Now the downward force F_ 
acting on the negative charge is greater than the upward force F, on the positive 
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fa] 





oad 


Fret | Fret t 


(@) (b) (c) 





FIGURE 7.14 (a) An electric dipole in a uniform electric field E experiences no net force. (b) In a nonuniform 
electric field (decreasing from the bottom of the figure to the top), the force F_ is greater than the force F,. 
There is a net downward force on the dipole. (c) If the dipole moment is reversed, the net force is in the opposite 
direction. 


charge. There is still a net torque that tends to rotate the dipole, but there is also a 
net force that tends to move the dipole, in this case downward. On the other hand, 
if we reverse the locations of the two charges (Figure 7.14c), which is equivalent 
to reversing the electric dipole moment p, the upward force F, on the positive 
charge is now greater than the downward force F_ on the negative charge, so the 
net force on the dipole is upward. 

We can state this result in another way that will be more applicable to our 
discussion of magnetic dipole moments. Let the field direction define the z axis. 
Then dipoles with p, > 0 (as in Figure 7.145) experience a net negative force and 
move in the negative z direction, while dipoles with p, < 0 (as in Figure 7.14c) 
experience a net positive force and move in the positive z direction. 

A magnetic dipole moment ji behaves in an identical way. (In fact, if we 
imagine fictitious N and S poles, the behavior of a magnetic moment would 
be described by illustrations similar to Figure 7.14.) A nonuniform magnetic 
field acting on the magnetic moments gives an unbalanced force that causes a 
displacement. Figure 7.15 illustrates the behavior of magnetic dipole moments 
having different orientations in a nonuniform magnetic field. The two different 
orientations give net forces in opposite directions: if jz, is positive the force on 
the dipole is negative, and if jz, is negative the force on the dipole is positive. 


el 
el 


The Stern-Gerlach Experiment 


Imagine the following experiment, illustrated schematically in Figure 7.16. A Fret | Fret t 
beam of hydrogen atoms is prepared in the n = 2,/ = | state. The beam consists 

of equal numbers of atoms in the m, = —1, 0, and +1 states. (We assume we can FIGURE 7.15 Two magnetic dipoles 
do the experiment so quickly that the n = 2 state doesn’t decay to the n = | state. in a nonuniform magnetic field. Oppo- 
In practice this may not be possible.) The beam passes through a region in which _ sitely directed dipoles experience net 
there is a nonuniform magnetic field. The atoms with m; = +1 (uy, = —/g) _ forces in opposite directions. 
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Oven 





Screen 


FIGURE 7.16 Schematic diagram of the Stern-Gerlach experiment. A beam of atoms 
passes through a region where there is a nonuniform magnetic field. Atoms with their 
magnetic dipole moments in opposite directions experience forces in opposite directions. 





(b) 


FIGURE 7.17. The results of the 
Stern-Gerlach experiment. (a) The 
image of the slit with the field turned 
off. (b) With the field on, two images 
of the slit appear. The small divi- 
sions in the scale at the left represent 
0.05mm. [Source: W. Gerlach and 
O. Stern, Zeitschrift fiir Physik 9, 349 
(1922)] 


experience a net upward force and are deflected upward, while the atoms with 
m, = —1 (My, = +p) are deflected downward. The atoms with m,; = 0 are 
undeflected. 

After passing through the field, the beam strikes a screen where it makes a 
visible image. When the field is off, we expect to see one image of the slit in the 
center of the screen, because there is no deflection at all. When the field is on, we 
expect three images of the slit on the screen—one in the center (corresponding to 
m, = 0), one above the center (m, = +1), and one below the center (m, = —1). 
If the atom were in the ground state (/ = 0), we expect to see one image in the 
screen whether the field was off or on (recall that a m, = 0 atom is not deflected). 
If we had prepared the beam in a state with / = 2, we would see five images with 
the field on. The number of images that appears is just the number of different m, 
values, which is equal to 2/ + 1. With the possible values for / of 0, 1, 2, 3,..., 
it follows that 27+ 1 has the values 1, 3, 5, 7,...; that is, we should always see 
an odd number of images on the screen. However, if we were actually to perform 
the experiment with hydrogen in the / = 1 state, we would find not three but 
six images on the screen! Even more confusing, if we did the experiment with 
hydrogen in the / = 0 state, we would find not one but two images on the screen, 
one representing an upward deflection and one a downward deflection! In the 
/ = Ostate, the vector L has length zero, and so we expect that there is no magnetic 
moment for the magnetic field to deflect. We observe this not to be true—even 
when / = 0, the atom still has a magnetic moment, in contradiction to Eq. 7.21. 

The first experiment of this type was done by O. Stern and W. Gerlach in 1921. 
They used a beam of silver atoms; although the electronic structure of silver is 
more complicated than that of hydrogen (as we discuss in Chapter 8), the same 
basic principle applies—the silver atom must have / = 0, 1,2,3,..., and so an odd 
number of images is expected to appear on the screen. In fact, they observed the 
beam to split into two components, producing two images of the slits on the screen 
(see Figure 7.17). 

The observation of separated images was the first conclusive evidence of spatial 
quantization; classical magnetic moments would have all possible orientations and 
would make a continuous smeared-out pattern on the screen, but the observation 
of a number of discrete images on the screen means that the atomic magnetic 


moments can take only certain discrete orientations in space. These correspond to 
the discrete orientations of the magnetic moment (or, equivalently, of the angular 
momentum). 

However, the number of discrete images on the screen does not agree with 
our expectations that it be an odd number. We expect 2/+ 1 images, so for 
two images we should have / = !/2, which is not permitted by the Schrédinger 
equation. We can resolve this dilemma if there is another contribution to the 
angular momentum of the atom, the intrinsic angular momentum of the electron. 
An electron in an atom has two kinds of angular momentum, somewhat like the 
Earth as it both orbits the Sun and rotates on its axis. The electron has an orbital 
angular momentum L, which characterizes the motion of the electron about the 
nucleus, and an intrinsic angular momentum S, which behaves as if the electron 
were spinning about its axis. For this reason, S is usually called the intrinsic spin. 
(However, it is not correct to use the classical analogy to think of the electron as a 
tiny ball of charge spinning about an axis, because the electron is a point particle 
with no physical size.) The idea of electron spin was proposed by S. A. Goudsmit 
and G. E. Uhlenbeck in 1925, and P. A. M. Dirac showed in 1928 that relativistic 
quantum theory for the electron gives the electron spin directly as an additional 
quantum number. 

In order to explain the result of the Stern-Gerlach experiment, we must assign 
to the electron an intrinsic spin quantum number s of 1/2. The intrinsic spin 
behaves much like the orbital angular momentum; there is the quantum number 
s (which we can regard as a label arising from the mathematics), the angular 
momentum vector S, a z component S_, an associated magnetic moment jig, and a 
spin magnetic quantum number m,. Figure 7.18 illustrates the vector properties of 
S, and Table 7.2 compares the properties of orbital and spin angular momentum 
for electrons in atoms. 

The inclusion of spin gives a direct explanation for the Stern-Gerlach exper- 
iment. The outermost electron in a silver atom occupies a state with /= 0. 
(The other electrons do not contribute to the magnetic properties of the atom.) 
The magnetic behavior is therefore due entirely to the spin magnetic moment, 
which has only two possible orientations in the magnetic field (corresponding to 
m, = +!'/2) and thus gives the two beams observed emerging from the magnet. 

Every fundamental particle has a characteristic intrinsic spin and a correspond- 
ing spin magnetic moment. For example, the proton and neutron also have a spin 
quantum number of !/2. The photon has a spin quantum number of 1, while the pi 
meson (pion) has s = 0. 





TABLE 7.2 Orbital and Spin Angular Momentum of Electrons in Atoms 
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FIGURE 7.18 The spin angular mo- 
mentum of an electron and the spatial 
orientation of the spin angular momen- 
tum vector. 
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| Example 7.8 





In a Stern-Gerlach type of experiment, the magnetic 
field varies with distance in the z direction according 
to dB,/dz = 1.4T/mm. The silver atoms travel a distance 
x = 3.5cm through the magnet. The most probable speed 
of the atoms emerging from the oven is v = 750 m/s. Find 
the separation of the two beams as they leave the mag- 
net. The mass of a silver atom is 1.8 x 10~*> kg, and its 
magnetic moment is about | Bohr magneton. 


Solution 
The potential energy of the magnetic moment in the mag- 
netic field is 

U= —.-B= —,B, 
because the field along the central axis of the magnet has 


only a z component. The force on the atom can be found 
from the potential energy according to 





The acceleration of a silver atom of mass m as it passes 
through the magnet is 


_F, _ u,(dB,/dz) 
~ m ~ m 


The vertical deflection Az of either beam can be found 
from Az = Sat’, where ¢, the time to traverse the magnet, 
equals x/v. Each beam is deflected by this amount, so the 
net separation d is 2 Az, or 


(dB, /dz)x? 
- mv 
(9.27 x 10774 WT)(1.4 x 103 T/m)(3.5 x 107? m) 
7 (1.8 x 10-25 kg)(750 m/s) 


=1.6x 10-*m=0.16mm 


d 





This is consistent with the separation that can be read from 
the scale in Figure 7.17. 





7.7 ENERGY LEVELS AND SPECTROSCOPIC 


NOTATION 





We previously described all of the possible electronic states in hydrogen by three 
quantum numbers (n, /, m;), but as we have seen, a fourth property of the electron, 
the intrinsic angular momentum or spin, requires the introduction of a fourth 
quantum number. We don’t need to specify the spin s, because it is always !/2 
(we regard it as a fundamental property of the electron, like its electric charge or 
its mass), but we must specify the value of the quantum number m, (+!/2 or — 1/2), 
which tells us about the z component of the spin. Thus the complete description of 
the state of an electron in an atom requires the four quantum numbers (n, /, m), m,). 

For example, the ground state of hydrogen was previously labeled as (n, /, m;) = 
(1,0,0). With the addition of m,, this would become either (1,0,0,+!/2) 
or (1,0,0,—1!/2). The degeneracy of the ground state is now 2. The first 
excited state would have eight possible labels: (2,0,0,+/1/2), (2,0,0, —1/2), 
(2,1,4+1,+'/), (2,1,+1,-'/), (2,1,0,+'/2), (2,1,0,—'/2), (2,1,-1,+1/2), 
and (2,1,—1,—1/2). There are now two possible labels for each previous single 
label (each n,/,m, becomes n,/,m,, —'/2 and n,/,m;,, —'/2, so the degeneracy of 
each level is 2n” instead of n?. 

It is important to know the direction (z component) of the angular momentum 
vectors when an atom is in a magnetic field, but for most other applications the 
values of m, and m, are of no significance, and it is cumbersome to write them 
each time we wish to refer to a certain level of an atom. We therefore use a 
different notation, known as spectroscopic notation, to label the levels. In this 


system we use letters to stand for the different / values: for / = 0, we use the letter 
s (do not confuse this with the quantum number s), for / = 1, we use the letter p, 
and so on. The complete notation is as follows: 


Value of / 0 1 2 3 + 5 6 
Designation Ss 7) d f g h i 


(The first four letters stand for sharp, principal, diffuse, and fundamental, which 
were terms used to describe atomic spectra before atomic theory was developed.) 
In spectroscopic notation, the ground state of hydrogen is labeled 1s, where the 
value n = | 1s specified before the s. Figure 7.19 illustrates the labeling of the 
hydrogen atom levels in this notation. 

Also shown in Figure 7.19 are arrows representing some different photons that 
can be emitted when the atom makes a transition from one state to a lower state. 
Some of the missing arrows (such as 4d to 3s) would represent transitions that are 
not allowed to occur. By solving the Schrédinger equation and using the solutions 
to compute transition probabilities, we find that the transitions most likely to 
occur are those that change / by one unit. This restriction is called a selection rule, 
and for atomic transitions the selection rule is 


Al= +1 (7.24) 


For example, the 3s level cannot emit a photon in a transition to the 2s level 
(Al = 0), but rather must go to the 2p level (A/ = 1). There is no selection rule 
for n, so the 3p level can go to 2s or 1s (but not to 2p). 


*7.8 THE ZEEMAN EFFECT 


Consider for the moment a hypothetical (and less interesting) world in which 
the electron has no spin, and therefore no spin magnetic moment. Suppose we 
prepared a hydrogen atom in a 2p (/ = 1) level and placed it in an external 
uniform magnetic field B (supplied by a laboratory electromagnet, for example). 
The magnetic moment i; associated with the orbital angular momentum then 
interacts with the field, and the energy associated with this interaction is 

(Sai, 18 (7.25) 
That is, magnetic moments aligned in the direction of the field have less energy 
than those aligned oppositely to the field. Using Eq. 7.22 for the z component of 
the magnetic moment (assuming that the field is in the z direction), we have 


U = py B = mppB (7.26) 


in terms of the Bohr magneton jzg defined in Eq. 7.23. In the absence of a 
magnetic field, the 2p level has a certain energy E, (—3.4eV). When the field 
is turned on, the energy becomes Ey + U = E, + m)4pB; that is, there are now 
three different possible energies for the level, depending on the value of m). 
Figure 7.20 illustrates this situation. 


*This is an optional section that may be skipped without loss of continuity. 
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-13.6 eV 


FIGURE 7.19 A partial energy level 
diagram of hydrogen, showing the 
spectroscopic notation of the levels 
and some of the transitions that satisfy 
the A/ = +1 selection rule. 






m,=+1 
l=1,m,=0, +1 

m,=0 
Field off 

m=-1 


Field on 


FIGURE 7.20 The splitting of an / = 
1 level in an external magnetic field. 
(The effects of the electron’s spin 
angular momentum are ignored.) The 
energy in a magnetic field is different 
for different values of m,. 
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FIGURE 7.21 The normal Zeeman 


effect. When the field is turned on, the 
original wavelength 4 becomes three 
separate wavelengths. 


| Example 7.9 


Now suppose the atom emits a photon in a transition from the 2p state to the 
ls ground state. In the absence of the magnetic field, a single photon is emitted 
with an energy of 10.2eV and a corresponding wavelength of 122 nm. When 
the magnetic field is present, three photons can be emitted, with energies of 
10.2eV + ppB, 10.2 eV, and 10.2 eV — upB. To determine how a small change 
in energy AE affects the wavelength, we differentiate the expression E = hc/i 
and obtain 


he 
dE = — 2 dx (7.27) 
Replacing the differentials with small differences, taking absolute magnitudes, 


and solving for AA gives 


2 
Ak = —AE 
he 


(7.28) 
where AF is the energy splitting between the levels when the field is on 
(AE = ppB). Figure 7.21 illustrates the three transitions, and shows an example 
of the result of a measurement of the emitted wavelengths. 

In analyzing transitions between different m, states, often we need to use a 
second selection rule: the only transitions that occur are those that change m, by 
0, +1, or —1: 


Am, = 0, +1 (7.29) 


Changes in m, of two or more are not permitted. 





Compute the change in wavelength of the 2p — 1s photon 


and so, from Eq. 7.28, 


when a hydrogen atom is placed in a magnetic field of 


2.00 T. 


Solution 


The energy of the photon from n=2 to n=1 is 
E=—13.6 eV(sy _ =) = 10.2eV, and its wavelength 
is A =hc/E = (1240 eV-nm)/(10.2 eV) = 122nm. The 


energy change AEF of the levels is 


AE = ppB = (9.27 x 10-*4 J/T)(2.00 T) 
= 18.5 x 10°47 J= 11.6 x 10> eV 


2 
AA = —AE 
he 


(122 nm) a8 
= ———__—_ 11,6 « 10° eV 
1240eV-nm 


= 0.00139 nm 
Even for a fairly large magnetic field of 2 T, the change in 


wavelength is very small, but it is easily measurable using 
an optical spectrometer. 





The experiment we have just considered is an example of the Zeeman 
effect—the splitting of a spectral line with a single wavelength into lines with 
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several different wavelengths when the emitting atoms are in an externally applied 

magnetic field. In the normal Zeeman effect a single spectral line splits into three 

components; this occurs only in atoms without spin. (All electrons of course have ’ 

spin, unlike the hypothetical spinless electrons we considered; however, in certain . Os = wae 

atoms with several electrons, the spins can pair off and cancel, so that the atom : | 

behaves like a spinless one.) When spin is present, we must consider not only ‘ i 

the effect of the orbital magnetic moment but also the spin magnetic moment. i 

The resulting pattern of level splittings is more complicated, and spectral lines 

may split into more than three components. This case is known as the anomalous FIGURE 7.22 Theanomalous Zee- 

Zeeman effect, an example of which is shown in Figure 7.22. man effect in sodium. (Top) The 
so-called sodium D-lines, a close- 
lying doublet of wavelengths 589.0 
and 589.6nm in the absence of a 


magnetic field. (Bottom) Splitting of 
*7.9 FINE STRUCTURE the lines into six and four components 
in a magnetic field. This image was 
A careful inspection of the emission lines of atomic hydrogen shows that many _ photographed by Peter Zeeman in 
of them are in fact not single lines but very closely spaced combinations of two —_ 1897. 
lines. In this section we examine the origin of that effect, known as fine structure. 
In this calculation it is more convenient for us to examine the hydrogen atom 
from the electron’s frame of reference, in which the proton appears to travel 
around the electron, just as the Sun appears to travel around the Earth. For 
convenience, we treat this problem in the context of the Bohr model to obtain an 
estimate of the effect. 
Figure 7.23a shows the atom viewed from the ordinary frame of reference of 
the proton. We assume the electron to orbit counterclockwise so that the orbital 
angular momentum L is in the z direction, and we also assume that the spin S 
(which could point either up or down) is also in the z direction. The same situation 
is shown in Figure 7.235 from the viewpoint of the electron, with the proton now 
appearing to move in a circular orbit around the electron. 
In the electron reference frame the motion of the proton in a circular orbit of 
radius r can be considered to be a current loop, which causes a magnetic field B 
at the electron, as shown in Figure 7.23c. This magnetic field interacts with the 
spin magnetic moment of the electron, jig = —(e/m)S. The interaction energy of 
the magnetic moment jig in a magnetic field is 





U=—ji,-B (7.30) 
We choose the z direction to be the direction of B: with Hs = —(e/ m)S, we have 
U=<8.B=“s.B (7.31) 
m m 
With S, = +14, the energy is 
hi 
Vat P= 4eP (7.32) 
2m 


*This is an optional section that may be skipped without loss of continuity. 
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FIGURE 7.24 The fine-structure split- 
ting in hydrogen. The state with L and 
S parallel is slightly higher in energy 
than the state with L and S antiparallel. 


FIGURE 7.23 An electron circulates about the proton in a hydrogen atom. (b) 
From the point of view of the electron, the proton circulates about the electron. 
(c) The apparently circulating proton is represented by the current i and causes a 
magnetic field B at the location of the electron. 


The situation shown in Figure 7.23 has S, = +5 ‘h, and thus U = +pB. When S 
has the opposite orientation, U = —1B. The fect; is to split each level into two, 
a higher state with Land§S parallel and a lower state with Land§ antiparallel, as 
shown in Figure 7.24. The energy difference between the states is AE = 2upB. 

At this point, the result looks rather similar to that of our previous discussion 
of the Zeeman effect, but it is important to note one significant difference: the 
magnetic field B in this case is not a field in the laboratory that can be turned on 
or off; it is, instead, a field that is always present, produced by the relative motion 
between the proton and the electron. 

We can use the Bohr model to make a rough estimate of the magnitude of this 
energy splitting. A circular loop of radius r carrying current i establishes at its 
center a magnetic field B = oi/2r.The current 7 is the charge carried around the 
loop (+e in this case) divided by the time 7 for one orbit. The time for one orbit 
is the distance traveled (27) divided by the speed v. 


Hol Moe _— Mo ev 





B= = = (7.33) 
2r 2r T 2r 2nr 
The energy difference between the states is then 
2 A 
Meo fo a (7.34) 


Qmr2 "BO Agm2r3 


where the last result is obtained by substituting v = nf/mr from Bohr’s angu- 
lar momentum condition (Eq. 6.26) and tp = ef/2m from Eq. 7.23. Finally, 
substituting from Eq. 6.28 for the radius of the orbits in the Bohr atom, we obtain 


Hoc ( me* a —— wome® 1 


AE = = 
4nm? \ Artegh? n? 256 Rh’ n° 





(7.35) 


We can rewrite this in a somewhat simpler form by recalling that c* = 1/e {9 
and using the dimensionless constant a, known as the fine structure constant, 


e 


= 7.36 
" Aneghc —o) 





which gives 


AE = mc’a*t — 
nm 


(7.37) 
The value of the fine structure constant is approximately 1/137. For hydrogen in 
the n = 2 level, we expect the energy difference between the state with Land § 
parallel and the state with LandS antiparallel to be 


4 
= (0.511 MeV) (= : ) De 4.53 x 10-7eV 

137) 2° 
We can compare this estimate with the experimental value, based on the observed 
splitting of the first line of the Lyman series, which gives 4.54 x 10~> eV. We see 
that in spite of the assumptions we have made, our use of the Bohr model, and our 
failure to use the hydrogen wave functions to do this calculation, the agreement 
with the experimental value is remarkably good. (In fact, the agreement is so good 
as to be embarrassing, for we neglected to consider the important relativistic effect 
of the motion of the electron, which contributes to the fine structure about equally 
to the spin-orbit interaction discussed in this section. We really should regard this 
calculation as an order-of-magnitude estimate, which happens by chance to give 
a numerical result close to the observed value.) 


Chapter Summary 221 














Section Section 
Orbital angular |L| = Vi 4 Dh 7.2 Angular probabi-  P(0,$) = [On (0)®n, (I? 75 
momentum (7 =0,1,2,...) lity density 
Orbital magnetic L, = mh 7.2 | Orbital magnetic L = —(e/2m)L 7.6 
quantum number = (m, = 0,1,+2,...,+/) dipole moment 
Spatial Q ” m, 7.2 | Spin magnetic bbs = —(e/m)S 7.6 
quantization ciao IL] WED dipole moment 
Angularmomen- AL,Ag >h 7.2 | Spin angular |S| = /s(s + Dh = /3/4h 1.6 
tum uncertainty momentum (for s = 1/2) 
relationship Spin magnetic S, = mh (m, = £1/2) 7.6 
Hydrogen n= 1,2,3,... 7.3. | quantum number 
quantum numbers /=0,1,2,...,n—1 Spectroscopic s(1=0), pd = 1), dU = 2), a9 
m, = 0,1,42,...,4/ notation fl = 3),... 
4 
Hydrogen energy E, = ee a fe Selection rules for Al=+1 Am, = 0, +1 Tels 7.8 
levels 3207 eGR? n° photon emission 
a i 
Hydrogen wave Va (r,0,06) = 7.3 | Normal Zeeman Ax = ie = 7c BB 7.8 
functions R,, 11) Oj m j (@)®,, ; (od) effect c c 
Radial probability P(r) = r7|R,,)(7)|’ 7.4 | Fine-structure AE = meat/n5 (a¥1/137) 71.9 
density estimate 





222 Chapter 7 | The Hydrogen Atom in Wave Mechanics 


10. 


How does the quantum-mechanical interpretation of the 
hydrogen atom differ from the Bohr model? 

How does a quantized angular momentum vector differ from 
a classical angular momentum vector? 

What are the meanings of the quantum numbers n, /, m, 
according to (a) the quantum-mechanical calculation; 
(b) the vector model; (c) the Bohr (orbital) model? 

List the dynamical quantities that are constant for a specific 
choice of n and /. List the dynamical quantities that are not 
constant. Compare these lists with the Bohr model. 

How does the orbital angular momentum differ between the 
Bohr model and the quantum-mechanical calculation? 
What does it mean that L precesses about the z axis? Can 
we observe the precession? 

In the Bohr model, we calculated the total energy from 
the potential energy and kinetic energy for each orbit. In 
the quantum-mechanical calculation, is the potential energy 
constant for any set of quantum numbers? Is the kinetic 
energy? Is the total energy? 

What is meant by the term spatial quantization? Is space 
really quantized? 

A deficiency of the Bohr model is the problem of angu- 
lar momentum conservation in transitions between levels. 
Discuss this problem in relation to the quantum-mechanical 
angular momentum properties of the atom, especially the 
selection rule Eq. 7.24. The photon can be considered to 
carry angular momentum fh. 

The 2s electron has a greater probability to be close to the 
nucleus than the 2p electron and also a greater probability 
to be farther away (see Figure 7.10). How is this possible? 


7.1 A One-Dimensional Atom 


1, 


By substituting the wave function y(x) = Axe~>* into 
Eq. 7.2, show that a solution can be obtained only for 
b = 1/ao, and find the ground-state energy. 

Show that the probability density for the ground-state solu- 
tion of the one-dimensional Coulomb potential energy has 
its maximum at x = dp. 

An electron in its ground state is trapped in the one- 
dimensional Coulomb potential energy. What is the prob- 
ability to find it in the region between x= 0.99a, and 
x= 1.0lay? 


7.2 Angular Momentum in the Hydrogen Atom 


4. 


An electron is in an angular momentum state with / = 3. 
(a) What is the length of the electron’s angular momentum 


11. 


12, 


13% 


14. 


15; 


16. 


17. 


18. 


19, 


20. 


The probability density y*y does not depend on @ for the 
wave functions listed in Table 7.1. What is the significance 
of this? 

How would the wave functions of Table 7.1 change if the 
nuclear charge were Ze instead of e? (Recall how we made 
the same change in the Bohr model in Section 6.5.) What 
effect would this have on the radial probability densities 
P(r)? 

Can a hydrogen atom in its ground state absorb a photon (of 
the proper energy) and end up in the 3d state? 

Is it correct to think of the electron as a tiny ball of charge 
spinning on its axis? Is it useful? Is this situation similar 
to using the Bohr model to represent the electron’s orbital 
motion? 

The photon has a spin quantum number of 1, but its spin 
magnetic moment is zero. Explain. 

What are the similarities and differences between Zeeman 
splitting and fine-structure splitting? 

How would the calculated fine structure be different in an 
atom with a single electron and a nuclear charge of Ze? 
Does the fine structure, as we have calculated it, have any 
effect on the n = 1 level? 

How would (a) the Zeeman effect and (b) the fine structure 
be different in a muonic hydrogen atom? (See Problems 41 
and 42 in Chapter 6.) The muon has the same spin as the 
electron, but is 207 times as massive. 

Even though our calculation of the fine structure was based 
on a very simplified model, it does yield a result similar 
to the more correct calculation: the fine-structure splitting 
decreases as we go to higher excited states. Give at least two 
qualitative reasons for this. 


vector? (b) How many different possible z components can 
the angular momentum vector have? List the possible z 
components. (c) What are the values of the angle that the 
L vector makes with the z axis? 


What angles does the L vector make with the z axis when 
l=? 


7.3 The Hydrogen Atom Wave Functions 


6. 


List the 16 possible sets of quantum numbers n, /, m, of the 
n = 4 level of hydrogen (as in Figure 7.6). 

(a) What are the possible values of / for n = 6? (b) What are 
the possible values of m, for / = 6? (c) What is the smallest 
possible value of n for which / can be 4? (d) What is the 
smallest possible / that can have a z component of 4/1? 


10. 


11. 


Show that the (1, 0, 0) and (2, 0, 0) wave functions listed in 
Table 7.1 are properly normalized. 

Show by direct substitution that the n = 2,/ = 0,m, = 0 and 
n=2,1=1,m,=0 wave functions of Table 7.1 are both 
solutions of Eq. 7.10 corresponding to the energy of the first 
excited state of hydrogen. 

Show by direct substitution that the wave function corre- 
sponding to n = 1,/ = 0,m, = 0 is a solution of Eq. 7.10 
corresponding to the ground-state energy of hydrogen. 
Consider a thin spherical shell located between r = 0.49a 
and 0.5la). For the n =2,/= 1 state of hydrogen, find 
the probability for the electron to be found in a small vol- 
ume element that subtends a polar angle of 0.11° and an 
azimuthal angle of 0.25° if the center of the volume ele- 
ment is located at: (a) 0 = 0,¢ = 0; (b) 6 = 90°,¢6 = 0; 
(c)@ = 90°, 6 = 90°; (d) 6 = 45°, 6 = 0. Do the calculation 
for all possible m, values. 


7.4 Radial Probability Densities 


12. 


13. 


14. 


15. 


Show that the radial probability density of the 1s level has 
its maximum value at r = dp. 

Find the values of the radius where the n = 2,/ = 0 radial 
probability density has its maximum values. 

What is the probability of finding a n = 2,/ = 1 electron 
between ay and 2ay? 

For a hydrogen atom in the ground state, what is the proba- 
bility to find the electron between 1.00a, and 1.0lag? (Hint: 
It is not necessary to evaluate any integrals to solve this 
problem.) 


7.5 Angular Probability Densities 


16. 


17. 


Find the directions in space where the angular probability 
density for the / = 2,m, = +1 electron in hydrogen has its 
maxima and minima. 
Find the directions in space where the angular probability 
density for the / = 2,m, = 0 electron in hydrogen has its 
maxima and minima. 


7.6 Intrinsic Spin 


18. 


19. 


20. 


(a) Including the electron spin, what is the degeneracy 
of the n=5 energy level of hydrogen? (b) By adding 
up the number of states for each value of / permitted 
for n = 5, show that the same degeneracy as part (a) is 
obtained. 

For each / value, the number of possible states is 2(2/ + 1). 
Show explicitly that the tol number of states for each prin- 


cipal quantum number is > 2(21 + 1) = 2n’. This gives the 


degeneracy of each eaeney level, 

Explain why each of the following sets of quantum 
numbers (n,/,m,,m,) is not permitted for hydrogen. 
(a) (2,2,-1,+1/2) (6) (3, 1,4+2,—"/2) (c) (4,1, +1, -7/2) 
(d) 2,-1, +1, +'/) 
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7.7 Energy Levels and Spectroscopic Notation 


21. 


22. 


23. 


List the excited states (in spectroscopic notation) to which 
the 4p state can make downward transitions. 

(a) A hydrogen atom is in an excited 5g state, from which it 
makes a series of transitions by emitting photons, ending in 
the 1s state. Show, on a diagram similar to Figure 7.19, the 
sequence of transitions that can occur. (b) Repeat part (a) if 
the atom begins in the 5d state. 

(a) List in spectroscopic notation all levels with n = 7. 
(b) An electron is initially in the state with n = 7,/ = 2. List 
in spectroscopic notation all lower states to which transitions 
are allowed. 


7.8 The Zeeman Effect 


24. 


25. 


Consider the normal Zeeman effect applied to the 3d to 2p 
transition. (a) Sketch an energy-level diagram that shows 
the splitting of the 3d and 2p levels in an external mag- 
netic field. Indicate all possible transitions from each m, 
state of the 3d level to each m, state of the 2p level. (b) 
Which transitions satisfy the Am, = +1 or 0 selection rule? 
(c) Show that there are only three different transition energies 
emitted. 

A collection of hydrogen atoms is placed in a magnetic field 
of 3.50 T. Ignoring the effects of electron spin, find the 
wavelengths of the three normal Zeeman components (a) of 
the 3d to 2p transition; (b) of the 3s to 2p transition. 


7.9 Fine Structure 


26. 


21, 


Calculate the wavelengths of the components of the first line 
of the Lyman series, taking the fine structure of the 2p level 
into account. 

Calculate the energies and wavelengths of the 3d to 2p tran- 
sition, taking into account the fine structure of both levels. 
How many component wavelengths might there be in the 
transition? 


General Problems 


28. 


29. 


30. 


31. 


32. 


Show that the wave function W(x) = A(x + cx”)e7™ gives a 
solution to the Schrédinger equation for the one-dimensional 
Coulomb potential energy. Evaluate the constants A, b, c, and 
find the energy corresponding to this solution. 

Find the probabilities for the n = 2,/=0 andn=2,/= 1 
electron states in hydrogen to be further than r = 5a, from 
the nucleus. Which has the greater probability to be far from 
the nucleus? 

The mean or average value of the radius r can be found 
according to ry, = />° rP(r)dr. Show that the mean value 
of r for the 1s state of hydrogen is day. Why is this greater 
than the Bohr radius? 
Find the value of r,, 
levels. 

The mean or average value of the potential energy of 
the electron in a hydrogen atom can be found from 


(see Problem 30) for the 2s and 2p 
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33; 


U,, = U(r)P(r)dr. Find U,,, in the 1s state and compare 
with the potential energy computed with the Bohr model 
when n = 1. 

Suppose the source of atoms in a Stern-Gerlach experiment 
were an oven of temperature 1000 K. Assume the magnetic 
field gradient to be 10 T/m, and take the length of the mag- 
netic field region and the field-free region between magnet 
and screen to be 1 m each. Make any other assumptions 


34. 


you may need and estimate the separation of the images 
observed on the screen. 

For the 1s,2s, and 2p states of hydrogen, show that 
(r!),y = 1/n?ay. This turns out to be a general result 
for any state of hydrogen. Based on this result, explain why 
the Bohr model gives such a good estimate for the fine- 
structure splitting as well as for other magnetic effects due 
to the circulating electron. 


Chapter 





MANY-ELECTRON ATOMS 











This computer-generated drawing shows the structure of an atom of neon, with the electron 
rece) ey-le)ifinvyaelis1iglelelt(elaw-i0lacel0]aceliavemuat=Mex-\ali¢-\ Mm ale(eq(-10lseam Mat- Weld (elm lalars lms) 0) ai-)e-M (=) 0) coS1-) a1) 

the 7s electrons, the dark outer sphere is the 2s electrons, and the lobes are the 2p electrons. 
This is a more realistic picture of an atom than the “‘planetary”’ view developed in Chapter 6. 
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Physicists often attack complex problems by trying to separate the more important 
parts from the less important. For example, in analyzing the motion of the Earth 
in the Solar System, we can start by ignoring all bodies other than the Sun. With 
this simplification, we find that the Earth moves about the Sun in an elliptical 
orbit. Now we can account for the effect of the Moon, which introduces a slight 
“wobble” about the ellipse. Finally, we can introduce the much weaker effect of 
the gravitational pull of the other planets. 

It is tempting to try to use a similar approach to understand the motion of 
electrons in atoms with more than one electron. Unfortunately, we can’t analyze 
the motion of an electron in an atom with more than one electron by separating out 
the more and less influential forces. For example, in a neutral atom with atomic 
number Z, each electron experiences an electrostatic force due to the nucleus with 
a charge of +Ze, but it also experiences an electrostatic force due to all the other 
electrons with a total charge of —(Z — 1)e. The effect of the nucleus is comparable 
to the effect of the other electrons, which can’t be analyzed as a small correction. 

We are thus required to consider simultaneously the effect of the nucleus 
and each of the other electrons. The problem of the mutual interactions of three 
or more objects is an example of what physicists call the many-body problem. 
Exact, closed-form solutions to the Schrédinger equation cannot be found for such 
problems. The solutions must be obtained numerically using a computer. In this 
chapter, we consider an approximate set of energy levels for many-electron atoms, 
and we try to understand some of the properties of atoms (chemical, electrical, 
magnetic, optical, etc.) based on those energy levels. 


8.1 THE PAULI EXCLUSION PRINCIPLE 


Let’s begin by considering how the Z electrons in an atom might occupy the atomic 
energy levels. As a first guess, we might expect that all Z electrons will eventually 
cascade down to the lowest energy level, the 1s state. If this were correct, we 
would expect the properties of the atom to vary rather smoothly compared with its 
neighbors having Z + | electrons. Indeed, certain of the properties of atoms, such 
as the energies of the emitted X rays, show this smooth variation. However, other 
properties do not vary in this way and thus are not consistent with this model of 
all electrons in the same level. For example, neon (with Z = 10) is an inert gas; 
it is practically unreactive and does not form chemical compounds under most 
conditions. Its neighbors, fluorine (Z = 9) and sodium (Z = 11), are among the 
most reactive of the elements and under most conditions will combine with other 
substances, sometimes violently. As another example, nickel (Z = 28) is strongly 
magnetic (ferromagnetic) and, for a metal, does not have a particularly large 
electrical conductivity. Copper (Z = 29) is an excellent electrical conductor but 
is not magnetic. Such wide variations in properties between neighboring elements 
Wolfgang Pauli (1900-1958, Switzer- suggest that it is not correct to assume that all electrons occupy the same energy 
land). His exclusion principle gave level. 

the basis for understanding atomic The rule that prevents all of the electrons in an atom from falling into the ls 
structure. He also contributed to the level was proposed by Wolfgang Pauli in 1925, based on a study of the transitions 
development of quantum theory, to that are present, and those that are expected but not present, in the emission 
the theory of nuclear beta decay, and Spectra of atoms. Simply stated, the Pauli exclusion principle is as follows: 


to the understanding of symmetry in No two electrons in a single atom can have the same set of quantum numbers 
physical laws. (n, 1, m;,m,). 
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The Pauli principle is the most important rule governing the structure of atoms, 
and no study of the properties of atoms can be attempted without a thorough 
understanding of this principle. 

To illustrate how the Pauli principle works, consider the structure of helium 
(Z = 2). The first electron in helium, in the 1s ground state, has quantum numbers 
n=1,1=0,m, = 0,m, = +1/2 or —!/2. The second electron can have the same 
n, 1, and m,, but it cannot have the same m,, because the exclusion principle would 
be violated. Thus if the first 1s electron has m, = + !/2, the second 1s electron must 
have m, = —!/2. Now consider an atom of lithium (Z = 3). Just as with helium, 
the first two electrons will have quantum numbers (n, /,m);,m,) = (1,0,0,+!/2) 
and (1,0, 0, — !/2). According to the exclusion principle, the third electron cannot 
have the same set of quantum numbers as the first two, so it cannot go into the n = 
1 level, because there are only two different sets of quantum numbers available in 
the n = | level, and both of those sets have already been used. The third electron 
must therefore go into one of the n = 2 levels, and experiments indicate that the 
2s level is the next available. Without the Pauli principle, lithium would have 
three electrons in the 1s level; with the Pauli principle, we expect that lithium has 
two electrons in the 1s level and one electron in the 2s level. These two different 
possible structures for lithium would give very different physical properties, and 
the physical properties of lithium indicate that the structure with one electron in 
the 2s level is the correct one. 

We can continue this process with beryllium (Z = 4). The fourth electron can 
join the third electron in the 2s level, but that now completes the capacity of 
the 2s level—one of the electrons might have quantum numbers (n,/,m,,m,) = 
(2,0,0,+1/2) and the other might have (2,0,0,—!/2). There are no other sets of 
quantum numbers that an additional electron could have in the 2s level without 
duplicating one of the sets that has already been assigned and thus violating the 
Pauli principle. When we reach boron, with Z = 5, the fifth electron must go 
into a different level—one of the 2p levels. We might therefore expect that the 
properties of boron, with a 2p electron, would be different from the properties of 
lithium or beryllium, which have only 2s electrons. 

It is this process of first using up all of the possible quantum numbers for one 
level, and then placing electrons in the next level, that accounts for the variations 
in the chemical and physical properties of the elements. 





] Example 8.1 


A certain atom has six electrons in the 3d level. (a) What 
is the maximum possible total m, for the six electrons, and 
what is the total m, in that configuration? (b) What is the 
maximum possible total m, for the six electrons, and what 
would be the largest possible total m, in that configuration? 


Solution 

(a) For a d state /=2, so the possible m, values are 
+2,+1,0,—1, and —2. At most two electrons can be 
assigned m, of +2 according to the Pauli principle (one 
with m, = +!/2 and one with m, = —!/2). Similarly, two 
electrons can be assigned m, of +1 (again, with m, = + 1/2 
and m, = —!/2), and the remaining two electrons can be 


assigned to m, of 0. That gives a total m, of +6, with a total 
m, of 0. 

(b) To maximize m,, we can assign at most five electrons to 
m, = +!/2 (with corresponding m, values of +2, +1,0,—1, 
and —2). The sixth electron cannot also have m, = +!/2, 
because its m, value would be the same as one already 
assigned, which would violate the Pauli principle by hav- 
ing two electrons with the same m, and m, labels. The sixth 
electron must therefore have m, = —!/2, giving a total m, of 
+2. The first five electrons give a total m, of 0, so the largest 
total m, would be obtained by assigning the sixth electron to 
m, of +2, giving a total m, of +2. 
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FIGURE 8.1 Atomic subshells, in or- 
der of increasing energy. The energy 
groupings are not to scale, but rep- 
resent the relative energies of the 
subshells. 


8.2 ELECTRONIC STATES IN MANY-ELECTRON 


ATOMS 





Figure 8.1 illustrates the result of an approximate calculation of the order of the 
filling of energy levels in many-electron atoms as the atomic number Z increases. 
The 1s level is always the lowest energy level to be filled, and the 2s and 2p levels 
are fairly close in energy. The 2s level always lies a bit lower in energy than the 
2p level, and so the 2s level is filled before the 2p. (The fine-structure splitting 
is very small on the scale of this diagram.) We can understand why the 2s level 
lies lower in energy if we recall Example 7.6 and Figure 7.10. An electron in the 
2s level has a greater probability to be found at small radii compared with an 
electron in the 2p level. (Penetrating close to the nucleus, the 2s electron also is 
attracted by the full nuclear charge +Ze, while the 2p electron spends most of 
its time beyond the orbits of the 1s electrons where it is attracted by an effective 
charge that is less than the full charge of the nucleus. We’ll discuss this effect, 
which is called electron screening, in Section 8.3.) These two effects—closer 
penetration to the nucleus and screening—are responsible for the tighter binding 
of the 2s electrons compared with the 2p electrons. 

A more extreme example of the tighter binding of the penetrating orbits occurs 
for the n = 3 levels. The 3s electron penetrates the inner orbits (it has a large 
probability density at small r; see Figure 7.10), and the 3p electron penetrates 
almost as much. The 3d electron has negligible penetration of the inner orbits. As 
a result, the 3s and 3p levels are more tightly bound and therefore lower in energy 
than the 3d level. A similar effect occurs for the n = 4 levels—the tighter binding 
of the 4s and 4p electrons pulls their energy levels down so low that they almost 
coincide with the 3d level, as shown in Figure 8.1. The 3d and 4s levels are very 
close in energy—for some atoms the 3d level is lower and for some atoms the 
4s is lower. This small energy difference is an important factor that contributes to 
the large electrical conductivity of copper, as we discuss later in this chapter. 

The tighter binding of the penetrating s and p orbits also pulls the 5s and Sp 
levels down close to the 4d level, and similarly causes the 6s and 6p levels to 
appear at roughly the same energy as the Sd and 4f levels. 

As we learned in the case of the hydrogen atom, orbits with the same value 
of n all lie at about the same average distance from the nucleus. (The electrons 
in the penetrating orbits spend some of their time closer to the nucleus than the 
nonpenetrating orbits, but also some of their time further from the nucleus; the 
average distance from the nucleus of the penetrating orbits is then about the same 
as the average distance from the nucleus of the nonpenetrating orbits with the 
same value of . See Problem 31 in Chapter 7 for a verification of this property 
for the hydrogen atom.) The set of orbits with a certain value of n, with about the 
same average distance from the nucleus, is known as an atomic she//. The atomic 
shells are designated by letter, as follows: 


n 1 2 3 4 5 
Shell K L M N O 


The levels with a certain value of n and / (for instance, 2s or 3d) are known 
as subshells. According to the Pauli principle, the maximum number of electrons 
that can be placed in each subshell is 2(2/+ 1). The (2/ + 1) factor comes from 
the number of different m, values for each /, because m, can take the values 
0,+1,+2,+3,...,+/. The extra factor of 2 comes from the two different m, 
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values; for each m;, we can have m, = +!/2 or m, = —1/2. According to this 
scheme, the ls subshell has a capacity of 2(2 x 0+ 1) =2 electrons; the 3d 
subshell has a capacity of 2(2 x 2+ 1) = 10 electrons. (Note that this capacity 
doesn’t depend on n; any d subshell has a capacity of 10 electrons.) Table 8.1 
shows the ordering and capacity of the subshells. 

It is important to keep in mind exactly what is represented by Figure 8.1 
and Table 8.1. They give the order of filling of the energy levels, and so they 
represent only the “outer” or valence electrons. For example, the first 18 electrons 
fill the levels up through 3p, and the energy levels (subshells) available to the 
19th electron in potassium (Z = 19) or calcium (Z = 20) are well described by 
Figure 8.1. However, the energy levels appropriate to the 19th electron in a heavy 
element such as lead (Z = 82) would be very different. In this case it is more 
correct to describe the atom in terms of shells—all of the n = 3 states (the M@ 
shell) are grouped together, as are all of the n = 4 states (the N shell), and so 
forth. When we discuss the inner structure of the atom, as in the case of X rays, 
the ordering of Figure 8.1 is not appropriate, and it is more appropriate to group 
the levels by shells, as we do in Section 8.5. 


The Periodic Table 


Figure 8.2 shows the periodic table, which is an orderly array of the chemical 
elements, listed in order of increasing atomic number Z and arranged in such a 
way that the vertical columns, called groups, contain elements with rather similar 
physical and chemical properties. In this section we discuss the way in which the 
filling of electronic subshells helps us understand the arrangement of the periodic 
table. In later sections we examine some of the physical and chemical properties 
of the elements. 
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TABLE 8.1 Filling of Atomic Subshells 































































































n I | Subshell | Capacity 
2(2/+1) 
1 0 ls 2 
oD) 0 2s 2 
eZ. 1 2p 6 
3 0 35 2 
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4 0 4s 2 
3 2 3d 10 
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5 1 5p 
6 0 6s 
4 3 4f 14 
5 2 5d 10 
6 1 6p 
7 0 7s 
5 3 Sf 14 
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FIGURE 8.2 The periodic table of the elements. 
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In attempting to understand the ordering of subshells and the periodic table, 
we must follow two rules for filling the electronic subshells: 


1. The capacity of each subshell is 2(2/++1). (This is of course just another way 
of stating the Pauli exclusion principle.) 
2. The electrons occupy the lowest energy states available. 


To indicate the electron configuration of each element, we use a notation in 
which the identity of the subshell and the number of electrons in it are listed. The 
identity of the subshell is indicated in the usual way, and the number of electrons 
in that subshell is indicated by a superscript. Thus hydrogen has the configuration 
1s!, for one electron in the 1s shell, and helium has the configuration 1s?. Helium 
has both a filled subshell (the 1s) and a closed major shell (the K shell) and 
thus is an extraordinarily stable and inert element. With lithium (Z = 3), we 
begin to fill the 2s subshell; lithium has the configuration 1s?2s'. With beryllium 
(Z = 4, 1s?2s) the 2s subshell is full, and the next element must begin filling 
the 2p subshell (boron, Z = 5, 1s?2s*2p'). The 2p subshell has a capacity of six 
electrons, and with neon (Z = 10, 1s*2s?2p°) both the 2p subshell and the L shell 
(n = 2) are complete. 

The next row (or period) begins with sodium (Z = 11, 1s*2s*2p°3s'), and the 
3s and 3p subshells are filled in much the same way as the 2s and 2p subshells, 
ending with the inert gas argon (Z = 18, 1s?2s?2p°3s*3p°). The elements of the 
third row (period) are chemically similar to the corresponding elements of the 
second row (period), and so are written directly under them. The next electron 
might be expected to go into the 3d level. However, the highly penetrating orbit 
of the 4s electron causes the 4s level to appear at a slightly lower energy than the 
3d level, so the 4s subshell normally fills first. The configurations of potasstum 
(Z = 19) and calcium (Z = 20) are therefore respectively 1s*2s?2p°3s?3p°4s! 
and 1s*2s*2p°3s*3p°4s*. These elements have properties similar to, and therefore 
appear directly under, the corresponding elements with one and two s-subshell 
electrons in the second and third periods. 

We now begin to fill the 3d subshell. Because there is no 1d or 2d subshell, 
we would expect the first element with a d-subshell configuration to have rather 
different chemical properties from the elements we have placed previously; 
thus it should not appear in any of our previously occupied groups (columns), 
and so we begin a new group with scandium (Z = 21, 1s?2s*2p°3s*3p°4s*3d!). 
The 3d subshell eventually closes with zinc (Z = 30, 1s?2s*2p°3s*3p°4s*3d!°). 
Along the way there are some minor variations; the most important is copper, 
with Z = 29. For this case the 3d level lies slightly lower than the 4s level, 
and so the 3d subshell fills before the 4s, resulting in the configuration 
1s?2s?2p°3s73p°3d!94s'. As we discuss later, this configuration is responsible 
for the large electrical conductivity of copper. 

In the next series of elements, the 4p subshell is filled, from gallium (Z = 31) 
to the inert gas krypton (Z = 36). When we move to the next period, we fill the 5s 
subshell before the 4d subshell, and the series of 10 elements corresponding to the 
filling of the 4d subshell is written directly under the series that had unfilled config- 
urations in the 3d subshell. (Silver, with Z = 47, corresponds exactly to copper in 
the fourth period, with the 4d subshell filling before the 5s.) After the completion of 
the 4d subshell, the 5p subshell is filled, ending with the inert gas xenon (Z = 54). 

The next period begins with cesium and barium filling the 6s subshell. As was 
the case in the previous periods, the Sd and 6s lie at almost the same energy. How- 
ever, there is yet another subshell at about the same energy as the 6s and 5d—the 
4f subshell, which now begins to fill, from lanthanum to ytterbium. This series of 
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TABLE 8.2 Electronic Configurations of Some Elements 



































[Ar]4s?3d> La [Xe]6s75d! 
[Ar]4s!3q!° Ce [Xe]6s*5d'4f! 
[Ar]4s73d!° Pr [Xe]6s*4f? 

Be 172s? Ga [Ar]4s?3d!°4p! Gd [Xe]6s*5d!4f7 

B 1s*2s?2p! Kr [Ar]4s73d!°4p° Dy [Xe]6s24f!° 

Ne 1572s*2p° Rb [Kr]5s! Yb [Xe]6s*4f!4 

Na [Ne]3s! vi [Kr]5s*4d! Lu [Xe]6s*5d!4f'4 

Al [Ne]3s73p! Mo [Kr]5s!4d> Re [Xe]6s*5d°4f'4 

Ar [Ne]3s73p° Ag [Kr]5s!4d!° Au [Xe]6s!5q!4r4 

K [Ar]4s! In [Kr]5s74d!°5p! Hg [Xe]6s*5d!04r'4 

Sc [Ar]4s73d! Xe [Kr]5s74d!°5p° TH [Xe]6s?5d!4f4 6p! 

Gr [Ar]4s!3d> Cs [Xe]6s! Rn [Xe]6s75d!°4¢46p® 














A symbol in brackets [ | means that the atom has the configuration of the previous inert gas plus the additional 


electrons listed. 


elements, called the /anthanides or rare earths, is usually written separately in the 
periodic table, because there have been no other f-subshell elements under which 
to write them. The 4f subshell has a capacity of 14 electrons, and so there are 14 
elements in the lanthanide series. Once the 4f subshell is complete, we return to 
filling the Sd subshell, writing those elements in the groups under the corresponding 
3d and 4d elements, and then complete the period with the filling of the 6p sub- 
shell, ending with the inert gas radon (Z = 86). The seventh period is filled much 
like the sixth, with a series known as the actinides, written under the lanthanides, 
corresponding to the filling of the 5f subshell. 

What is most remarkable about this scheme is that the arrangement of the 
periodic table was known well before the introduction of atomic theory. The 
elements were organized into groups and periods based on their physical and 
chemical properties by Dmitri Mendeleev in 1859; understanding that organization 
in terms of atomic levels is a great triumph for the atomic theory. This way of 
organizing the elements gives us great insight into their physical and chemical 
properties, as we discuss in the next sections. 

Table 8.2 lists the electronic configurations of some of the elements. 


J Example 8.2 








Copper has the electronic configuration [Ar]4s!3d!° in its 
ground state. By adding a small amount of energy (about 
leV) to a copper atom, it is possible to move one of the 
3d electrons to the 4s level and change the configuration 
to [Ar]4s*3d°. By adding still more energy (about 5 eV), 
one of the 3d electrons can be moved to the 4p level so 
that the configuration becomes [Ar]4s!3d?4p!. For each of 


these configurations, determine the maximum value of the 
total m, of the electrons. 


Solution 

The electrons in the filled shells (Ar core) have a total m, of 
zero. In fact, any filled subshell has equal numbers of elec- 
trons inm, = +!/2 and m, = —!/2 states, which also gives 
a total of zero. In the 4s'3d'° configuration, only the single 
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4s electron contributes to m,, and its maximum value is have a total m, of 5 x (+ /2) +4 x (—!/2) = +1/2. In the 
+ Yo. In the 4s*3a” configuration, the two 4s electrons give 451344! configuration, each of the three subshells con- 


a total m, of zero. In the 3d subshell, there are 5 different m, 
values, so we can have at most 5 electrons with m, = + !/. 


tributes a maximum m, of +!/2, so the maximum total m, 


a ies 
The remaining 4 electrons must have m, = —'/2, so we 8+ /2. 
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FIGURE 8.3 Electron structure in 
lithium, as might be seen from the 
average location of an outer (2s) 
electron. The dashed line represents 
a spherical Gaussian surface at that 
location. 


8.3 OUTER ELECTRONS: SCREENING AND 


OPTICAL TRANSITIONS 





The electronic configurations of the alkali elements (those in the first column of 
the periodic table) all show a single s electron outside an inert gas core. These 
elements are very reactive, meaning they can easily give up the s electron to 
another element to form a chemical bond. For example, lithium (1s*2s!) readily 
gives up its 2s electron to form the positive ion Li*. 

It may at first seem somewhat surprising that Li gives up its electron so easily. 
The ionization energy of Li is 5.39 eV. This is smaller than the ionization energy 
of hydrogen (13.6eV), even though from Eq. 6.38 we might expect that the 
energies of electrons in atoms should increase in proportion to Z7. 

We can understand this effect from the diagram of Figure 8.3. The lithium 
atom can be roughly characterized by an inner atomic shell consisting of two ls 
electrons and a single electron in the 2s subshell. As was the case in the one- 
electron atoms we considered in Chapters 6 and 7, the principal quantum number 
n determines the average distance of an electron from the nucleus. Although there 
is no simple formula that allows us to calculate the average orbital radius in atoms 
with more than one electron, it is certainly reasonable to expect that the 2s electron 
is most likely to be found much farther from the nucleus than the 1s electrons. 

The net electric force on the 2s electron can be estimated using Gauss’s law. 
Imagine a spherical surface centered at the nucleus having a radius equal to the 
average orbital radius of the 2s electron. The electric field at that distance is 
determined, according to Gauss’s law, by the net charge contained within the 
sphere. The electrons in the 7 = | orbit have nearly a 100% probability of being 
found within the sphere. Thus the net charge inside the sphere must include the 
nucleus (++3e) and the two n = | electrons (—2e) for a total net charge of +e. To 
a good approximation, for some applications a lithium atom looks very much like 
a one-electron atom with the electron in the n = 2 orbit about a nucleus with an 
effective charge of +e. (Recall from electrostatics that if the charge distribution is 
spherically symmetric, we can replace an extended charge distribution with a point 
charge at the center of the sphere.) Equation 6.38 gives the energy of such an elec- 
tron in the n = 2 orbit in an atom with an effective nuclear charge of Z.ee = +e as 
Zeer 

nm 
This simple model predicts that the ionization energy of a neutral lithium atom 
is 3.40eV. The measured value is 5.39eV. The agreement is not extremely 
good, but the estimated value is off by much less than a factor of Z* = 9, so the 
calculation is probably on the right track. 

The difference between the measured and estimated values can be accounted 
for by an effect that we have already discussed: the penetration of the s electrons 
through the inner shells to be occasionally found close to the nucleus. The 2s 


E, = (-13.6eV) 





= —3.40eV (8.1) 
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electron sometimes finds itself much closer to the nucleus than its average orbital 
radius, and may occasionally be inside the n = | shell. In this case Gauss’s law 
tells us that the electron feels the full +3e charge of the nucleus, which results in 
an increase in the binding energy. 

Let’s instead consider an excited state of lithium, in which the 2s electron 
moves to the 2p state. The 2p electron penetrates the inner shell hardly at all. The 
energy of the 2p electron in lithium is —3.54 eV, in almost exact agreement with 
the prediction of our simple model. The small discrepancy might indicate a small 
degree of penetration of the 2p electron inside some of the 1s probability distribu- 
tion, which gives a small increase to the binding energy. If we instead move the 
outer electron to the 3d state, the measured energy is — 1.51 eV, in exact agreement 
with the prediction of Eq. 8.1 for = 3. The 3d electron has almost no penetration 
inside the 1s shell, and so that electron is very well described by Z. = 1. 

This effect is called electron screening. To an outer electron, the charge of the 
nucleus can be screened or shielded by the electrons in the inner shells. This is 
one case in which the formulas we derived for the energies of a one-electron atom 
can be used to determine approximately the energy of an electron in an atom with 
more than one electron. For the outer electron in lithium, the 3 positive charges 
in the nucleus are screened by the negative charges of the two inner electrons, 
giving a net charge of one unit. The less penetrating is the orbit of the outer 
electron, the more accurate is the prediction of Eq. 8.1. In lithium, for example, 
the 3d orbit has almost no penetration of the inner shells and so the formula gives 
a very accurate representation of the binding of that electron. The 2p orbit in 
lithium has relatively little penetration, so again the approximate formula gives a 
good prediction. It is less accurate for the 2s electron, which does occasionally 
penetrate through the inner 1s orbits. 

Electron screening can also be used in a qualitative way to help understand the 
ionization energies of atoms. Consider helium, for example. In ionized helium, the 
single electron has an energy of —54.4 eV in its ground state. If we add a second 
electron to make neutral helium (with both electrons in the 1s state), the ionization 
energy is 24.6 eV. The screening of one electron by a portion of the probability dis- 
tribution of the other is responsible for reducing the ionization energy from 54.4 eV 
when no second electron is present to 24.6 eV when the second electron is present. 





The ground state of helium has the configuration 1s’. 
Use the electron screening model to predict the energies 
of the following excited states of helium: (a) 1s!2s! (mea- 
sured value —4.0eV); (b) 1s'2p! (—3.4 eV); (c) 1s'3d! 
(—1.5 eV). 
Solution 
(a) For the outer electron in helium, the nuclear charge of 
+2e is screened by the single 1s electron, so the effective 
charge seen by the outer electron is +e. From Eq. 8.1, we 
have 7 2 
E, = (—13.6eV) = = (—13.6 VY) 


The measured value is -4.0eV, suggesting that the 2s 
electron has a small penetration through the 1s distribution 


= —3.4eV 





n 


and thus experiences a somewhat tighter binding than this 
simple model predicts. 

(b) Because Eq. 8.1 depends on n but not /, the calculation 
for the 2p excited state gives the same result as the calcula- 
tion for the 2s excited state (—3.4 eV). Now the agreement 
is almost exact, because the 2p has less penetration than 
the 2s. 

(c) For the 3d excited state, Eq. 8.1 gives 


12 
32 
The agreement is again very good, suggesting little pen- 


etration of the 3d electron inside the 1s probability 
distribution. 


VA 
E, = (-13.6eV) © = (—13.6eV)— =—1.5eV 
n 
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FIGURE 8.5 A small portion of the 
energy level diagram for helium. Note 
the A/ = +1 transitions. 


Optical Transitions 


When we excite one of the outer electrons to a higher energy level or remove 
it completely from the atom, the resulting vacancy can be filled by electrons 
dropping into the empty state. The energy lost by these electrons usually appears 
as emitted photons, which are in the visible range of the spectrum and are thus 
known as optical transitions. The binding energies of the outer electrons in a 
typical atom are of the order of several electron-volts, and so it takes relatively 
little energy to move an outer electron and produce an optical transition. In 
fact, it is the absorption and reemission of light by these outer electrons that 
are responsible for the colors of material objects (although in solids the electron 
energy levels are usually very different from those in isolated atoms). In contrast 
with X-ray spectra, which vary slowly and smoothly from one element to the 
next, optical spectra can show large variations between neighboring elements, 
especially those that correspond to filled subshells. 

Beyond hydrogen, the simplest energy-level diagrams to understand are those 
of the alkali metals, which have a single s electron outside an inert core. Many of 
the excited states then correspond to the excitation of this single electron, and the 
resulting spectra are very similar to the spectrum of hydrogen, because the nuclear 
charge of +Ze is screened by the other (Z — 1) electrons. Figure 8.4 shows the 
energy levels of Liand Na along with some of the emitted transitions, which follow 
the same A/ = +1 selection rule as the transitions in hydrogen (see Figure 7.19). 

The ground-state configuration of lithium is 1s*2s! and the ground-state 
configuration of sodium is 1s*2s*2p°3s!. The excited states in both cases can be 
obtained by moving the outer electron to a higher state. For example, the first 
excited state of Li is 1s*2p!, with the 2s electron moving to the 2p level. (The 
energy necessary to accomplish this can be provided by various means, such as 
by absorption of a photon or by passage of an electric current through the material 
as in a gas discharge tube.) The excited electron in the 2p state rapidly drops back 
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FIGURE 8.4 (a) Energy-level diagram of lithium, showing some of the transitions (labeled with wavelength in nm) in 
the optical region. (b) Energy-level diagram for sodium. Because of the fine-structure splitting, the 3p level in sodium is 
actually a very closely spaced pair of levels, so all transitions involving that level show two closely spaced wavelengths. 
The wavelength shown here is the average of the two. The fine-structure splitting is negligible for the other levels in sodium 


and for all of the levels in lithium. 
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to the 2s state, with the emission of a photon of wavelength 670.8 nm. The inert 
core doesn’t participate in this excitation or emission, so to a good approximation 
we can ignore all but the outer electron in studying the levels and transitions in 
the alkali elements. 

The ground-state configuration of helium is 1s*. We can produce an excited 
state by moving one of these electrons up to a higher level, and so some possible 
excited-state configurations might be 1s!2s!, 1s'2p!, 1s!3s', and so forth. Photons 
are emitted when the excited electron drops back to the 1s level. The AJ = +1 
selection rule for transitions once again limits those that can occur. Figure 8.5 
shows a portion of the energy level diagram for helium. 

The phenomenon of fluorescence is responsible for the appearance of objects #2 
under so-called “black light,” which is a source of ultraviolet radiation. Photons 
in the ultraviolet region, invisible to the human eye, have higher energies than 4, 18 ev 
those in the visible region, and hence if an ultraviolet photon is absorbed by an 
atom, the outer electron (which is responsible for the optical transitions) can be 
excited to high levels. These electrons make transitions back to their ground state, 
accompanied by the emission of photons in the visible region. Objects seen in 
ultraviolet light often show colors in the blue or violet end of the spectrum that 
are not present when the objects are viewed in sunlight. We can understand this 
effect by considering the composition of sunlight and the optical excited states of 
a hypothetical atom shown in Figure 8.6. The intensity of sunlight is concentrated 
in the center of the visible spectrum, in the yellow region; very little intensity 
is present in the red or blue ends of the visible spectrum. The “yellow” photons 
have enough energy to excite the hypothetical atom to levels 1 and 2 shown in 
Figure 8.6, but not enough to reach level 3 or 4. However, the higher-energy 
ultraviolet photons have sufficient energy to reach the higher levels, so the light 
emitted by the atom has a stronger blue component when that atom is excited by 
ultraviolet light than when excited by sunlight. 


| Example 8.4 


Ultraviolet 
#4 light 


Excited / #3 20-2 


states 
2.4 eV 
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Ground OeV 
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FIGURE 8.6 Excited states of a hypo- 
thetical atom. Only excited states 1 and 
2 can be easily reached by exposure to 
sunlight; exposure to ultraviolet light 
populates state 4, which in turn popu- 
lates state 3. Under ultraviolet light, a 
stronger blue or violet (413 nm) color 
is revealed than under sunlight. 





Calculate the energy difference between the 3d and 2p 
states in lithium, and compare with the corresponding 
energy difference in hydrogen. 


Solution 
From Figure 8.4, the wavelength of the photon emitted in 
the 3d to 2p transition is 610.4nm. The energy difference 
is then 

he = 1240eV-nm 


AE = = 
Xr 610.4nm 


= 2.03 eV 





The energy difference between corresponding levels 
in hydrogen (Figure 6.20) is £,;—£, =—1.5leV— 
(—3.40eV) = 1.89eV. Due to electron screening, we 
expect the outer electron in lithium to behave similarly 
to the electron in hydrogen, so the energy differences are 
in rough agreement. 





8.4 PROPERTIES OF THE ELEMENTS 


In this section we briefly study the way our knowledge of atomic structure helps us 
to understand the physical and chemical properties of the elements. Our discussion 
is based on the following two principles: 


1. Filled subshells are normally very stable configurations. An atom with one 
electron beyond a filled shell will readily give up that electron to another atom 
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to form a chemical bond. Similarly, an atom lacking one electron from a filled 
shell will readily accept an additional electron from another atom in forming 
a chemical bond. 

Filled subshells do not normally contribute to the chemical or physical 
properties of an atom. Only the electrons in the unfilled subshells need be 
considered. (X-ray energies, discussed in the next section, are an exception 
to this rule.) Sometimes only a single outer electron is the primary factor 
influencing the physical properties of an element. 


We consider a number of different physical properties of the elements, and try to 
understand those properties based on atomic theory. 




























































































1. Atomic Radii. The radius of an atom is not a precisely defined quantity, 
because the electron probability density determines the “size” of an atom. The 
radii are also difficult to define experimentally, and in fact different kinds of 
experiments may give different values for the radii. One way of defining the 
radius is by means of the spacing between the atoms in a crystal containing 
that element. Figure 8.7 shows how such typical atomic radii vary with Z. 

TABLE 8.3 Ionization Energies (in eV) 2. lonization Energy. Table 8.3 gives the ionization energies of some of the 
of Neutral Atoms of Some Elements elements, and Figure 8.8 shows the variation of ionization energy with atomic 
number Z. 
H IBYGO || Awe || 1576 3. Electrical Resistivity. In bulk materials, an electric current flows when a 
He | 24.59 434 potential difference (voltage) is applied across the material. The current i and 
voltage V are related according to the expression V = iR, where R is the 
Li oe) 7.72 electrical resistance of the material. If the material is uniform with length L 
Be 9.32 14.00 and cross-sectional area A, then the resistance is 
Ne | 21.56 4.18 ere (8.2) 
Na | 5.14 9.22 a 
The resistivity p is characteristic of the kind of material and is measured 
in units of 2-m (ohm- meter). A good electrical conductor has a small 
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elements. 
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resistivity (9 = 1.7 x 10-8 Q-m for copper); a poor conductor has a large 
resistivity (0 = 2 x 10'° Q-m for sulfur). From the atomic point of view, 
current depends on the movement of relatively loosely bound electrons, which 
can be removed from their atoms by the applied potential difference, and also 
on the ability of the electrons to travel from one atom to another. Thus 
elements with s electrons, which are the least tightly bound and which also 
travel farthest from the nucleus, are expected to have small resistivities. 
Figure 8.9 shows the variation of electrical resistivity with atomic number. 
4. Magnetic Susceptibility. When a material is placed in a magnetic field of 
intensity B, the material becomes “magnetized” and acquires a magnetization 
M, which for many materials is proportional to B: 


MyM = xB (8.3) 


where x is a dimensionless constant called the magnetic susceptibility. 
(Materials for which x > 0 are known as paramagnetic, and those for which 
x < Oare called diamagnetic; materials that remain permanently magnetized 
even when B is removed are known as ferromagnetic, and x is undefined for 
such materials.) 

From the atomic point of view, the magnetism of atoms depends on the 
Land S of the electrons in unfilled subshells, because the atomic magnetic 
moments jl; and jg are proportional to L and § (recall Table 7.2). This 
effect is responsible for paramagnetic susceptibilities and occurs in all atoms 
in which L or S is nonzero. Diamagnetism is caused by the following effect: 
when a varying magnetic field occurs in an area bounded by an electric 
circuit, an induced current flows in the circuit; the induced current sets up a 
magnetic field which tends to oppose the changes in the applied field (Lenz’s 
law). In the atomic physics case, the electric circuit is the circulating electron, 
and the induced current consists of a slight speeding up or slowing down of 
the electron in its orbit when a magnetic field is applied. This produces a 
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FIGURE 8.10 Magnetic susceptibilities of the elements. 


contribution to the magnetization of the material that is opposite to the applied 
field B, and so the diamagnetic contribution to x is negative. 
Figure 8.10 shows the magnetic susceptibilities of the elements. 


Just by examining Figures 8.7 to 8.10, you can see the remarkable regularities 
in the properties of the elements. Notice especially how similar the properties of 
the different sequences of elements are—for example, the electrical resistivity of 
the d-subshell elements or the magnetic susceptibility of the p-subshell elements. 
We now look at how the atomic structure is responsible for these properties. 


Inert Gases 


The inert gases occupy the last column of the periodic table. Because they have 
only filled subshells, the inert gases do not generally combine with other elements 
to form compounds; these elements are very reluctant to give up or to accept 
an electron. At room temperature they are monatomic gases. Their atoms don’t 
easily join together, so the boiling points are very low (typically —200°C). Their 
ionization energies are much larger than those of neighboring elements, because 
of the extra energy needed to break open a filled subshell. 


p-Subshell Elements 


The elements of the column (group) next to the inert gases are the halogens 
(F, Cl, Br, I, At). These atoms lack one electron from a closed shell and have 
the configuration np°. A filled p subshell is a very stable configuration, so 
these elements readily form compounds with other atoms that can provide an 
extra electron to complete the p subshell. The halogens are therefore extremely 
reactive. 

As we move across the series of six elements in which the p subshell is being 
filled, the atomic radius decreases. This “shrinking” occurs because the nuclear 
charge is increasing and pulling all of the orbits closer to the nucleus. Notice from 
Figure 8.7 that the halogens have the smallest radii within each p subshell series. 
(The ionic crystal radii of the inert gases are not known.) 
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As we increase the nuclear charge, the p electrons also become more tightly 
bound; Figure 8.8 shows how the ionization energy increases systematically as 
the p subshell is filled. 

From Figure 8.10 we see that each p subshell series is diamagnetic, with a 
characteristic negative magnetic susceptibility. 


s-Subshell Elements 


The elements of the first two columns (groups) are known as the alkalis (configu- 
ration ns') and alkaline earths (ns*). The single s electron makes the alkalis quite 
reactive. The alkaline earths are similarly reactive, in spite of the filled s subshell. 
This occurs because the s electron wave functions can extend rather far from the 
nucleus, where the electrons are screened (by Z — 2 other electrons) from the 
nuclear charge and therefore not tightly bound. (Notice from Figure 8.7 that the 
ns! and ns* configurations give the largest atomic radii, and from Figure 8.8 that 
they have the smallest ionization energies.) For the same reasons, the ns! and ns* 
elements are relatively good electrical conductors. From Figure 8.10 we see that 
these elements are paramagnetic; for / = 0, there is no diamagnetic contribution 
to the magnetism. 


Transition Metals 


The three rows of elements in which the d subshell is filling (Sc to Zn, Y to Cd, Lu 
to Hg) are known as the transition metals. Many of their chemical properties are 
determined by the outer electrons—those whose wave functions extend furthest 
from the nucleus. For the transition metals, these are always s electrons, which have 
a larger mean radius than the d electrons. (Remember that the mean radius depends 
mostly on n; the s electrons of the transition metals have a larger n than the d 
electrons. For example, in the first row of transition metals, the 3d subshell is filling 
but the 4s subshell is already filled.) As the atomic number increases across the 
transition metal series, we add one d electron and one unit of nuclear charge; the net 
effect on the s electron is very small, because the additional d electron screens the 
s electron from the additional nuclear charge. Properties of the transition metals, 
which are in large part determined by the outermost electrons, can therefore be 
very similar, as the small variation in radius and ionization energy shows. 

The electrical resistivity of the transition metals shows two interesting features: 
a sharp rise at the center of the sequence, and a sharp drop near the end (Figure 8.9). 
The sharp drop near the end of the sequence indicates the small resistivity (large 
conductivity) of copper, silver, and gold. If we filled the d subshell in the expected 
sequence, copper would have the configuration 4s?3d?; however, the filled d 
subshell is more stable than a filled s subshell, and so one of the s electrons 
transfers to the d subshell, resulting in the configuration 4s/3d!°. This relatively 
free, single s electron makes copper an excellent conductor. Silver (5s!4d!°) and 
gold (6s!5d!°) behave similarly. 

At the center of the sequence of transition metals there is a sharp rise in the 
resistivity; apparently a half-filled shell is also a stable configuration, and so Mn 
(3d>), Te (4d°), and Re (5d°) have larger resistivities than their neighbors. A 
similar rise in resistivity is seen at the center of the 4f sequence. 

The transition metals have similar paramagnetic susceptibilities, due to the 
large orbital angular momentum of the d electrons and also to the large number 
of d-subshell electrons that can couple their spin magnetic moments. These two 
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effects are large enough to overcome the diamagnetism of the orbital motion. It is 
the d electrons that are also responsible for the ferromagnetism of iron, nickel, and 
cobalt. As soon as the d subshell is filled, however, the orbital and spin magnetic 
moments no longer contribute to the magnetic properties (all of the m, and m, 
values, positive as well as negative, are taken); for this reason, copper and zinc 
are diamagnetic, not paramagnetic like their transition metal neighbors. 


Lanthanides (Rare Earths) 


The lanthanide (or rare earth) elements are contained in the series of 14 elements 
from La to Yb; this series is usually drawn at the bottom of the periodic chart 
of the elements. The rare earths are rather similar to the transition metals in that 
an “inner” subshell (the 4/) is being filled after an “outer” subshell (the 6s) is 
already filled. For the same reasons discussed above, the chemical properties of 
the rare earths should be rather similar, because they are determined mainly by 
the 6s electrons; the radii and ionization energies show that this is true. 

Because of the larger orbital angular momentum of f-subshell electrons (/ = 3) 
and also because of the larger number of f-subshell electrons (up to 14) that can 
align their spin magnetic moments, the paramagnetic susceptibilities of the rare 
earths are even larger than those of the transition metals. Even the ferromagnetism 
of the rare earths is substantially stronger than that of the iron group. Generally, 
we think of iron as the most magnetic of the elements. The internal magnetic field 
within a magnetized piece of iron is about 28 T. Magnetized holmium metal, a 
rare earth, has an internal magnetic field of 800 T, roughly 30 times that of iron! 
Most of the other rare earths have similar magnetic properties. (The rare earth 
metals do not reveal their ferromagnetic properties at room temperature, but must 
be cooled to lower temperatures. Holmium must be cooled to 20 K to reveal its 
ferromagnetic properties.) 


Actinides 


The actinide series of elements, which corresponds to the filling of the 5f subshell, 
is usually shown in the periodic table directly under the lanthanide series. These 
elements should have chemical and physical properties similar to those of the rare 
earths. Unfortunately, most of the actinide elements (those beyond uranium) are 
radioactive and do not occur in nature. They are artificially produced elements 
and are available only in microscopic quantities. We are thus unable to determine 
many of their bulk properties. 


8.5 INNER ELECTRONS: ABSORPTION EDGES 


AND X RAYS 





Let’s imagine doing the Franck-Hertz experiment (see Section 6.6), in which we 
accelerate a beam of electrons that then passes through a chamber filled with 
mercury vapor. However, instead of using accelerating voltages in the range of 
10 V, we’ll use voltages in the range of 10° V. Figure 8.11 shows the current 
passing through the tube as a function of the accelerating voltage. A sudden 
drop in the current occurs at 83.1 kV. Low accelerating voltages correspond to 
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interactions that push the outer electrons in the mercury atoms to higher excited 
states (or ionize the atom). The drop in the current at 83.1 kV occurs when the 
mercury atom absorbs energy from the electron beam that ionizes the atom by 
knocking loose one of the tightly bound inner electrons. The binding energy of 
the inner electron in this case is 83.1 keV. 

A similar experiment can be done by passing a beam of X rays through a 
thin film of mercury and measuring the absorption of the photon intensity. If we 
are able to vary the wavelength of the X rays, the absorption as a function of 
wavelength might look like Figure 8.12. Photons are absorbed from the beam by 
the photoelectric effect, in which electrons are knocked loose from mercury atoms. 
As the photon wavelength is increased (or as the photon energy is decreased), 
we reach a point at which the photons do not have enough energy to produce at 
least one component of photoelectrons, and thus there is a sudden decrease in the 
photon absorption. The wavelength at which this occurs, 0.0149 nm, corresponds 
to an energy of 83.1 keV, in agreement with the value deduced from electron 
scattering (Figure 8.11). 

The sudden drop in the electron current or in the photoelectron emission is 
called the absorption edge. It corresponds to the release of an inner electron from 
the atom. In the case of mercury, the most tightly bound (1s) electrons have 
a binding energy of 83.1 keV. In the electron scattering experiment, when the 
energy of the electrons in the beam exceeds 83.1 keV, the collision of an electron 
with a mercury atom can transfer an energy of 83.1 keV to the atom and result in 
the ejection of one of the 1s electrons. Similarly, when the photon energy exceeds 
83.1 keV (or when its wavelength is below 0.0149 nm), the photons can eject a 
photoelectron from the 1s level, but when the photon energy is below 83.1 keV 
that is not possible. 

As discussed in Section 8.3, the m = 1 level is also known as the K shell. So far 
we have been discussing the K absorption edge in mercury, which corresponds 
to the release of an electron from the K shell. It is also possible to release a less 
tightly bound electron from the L shell (7 = 2), in which case we would speak 
of the L absorption edge. In mercury, the L absorption edge is about 14 keV. 
(Because of the fine-structure splitting, there are actually three different states in 
the L shell with slightly different energies.) 

Figure 8.13 shows the K absorption edges of the elements. There is a very 
noticeable difference between the data shown in Figure 8.13 and those shown in 
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FIGURE 8.13 K absorption edges of the elements. 
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(K shell) 


Figures 8.7—8.10: the K absorption edges show no evidence for any shell effects. 
Instead, there is a smooth dependence on the atomic number over the entire range 
of elements. As the nuclear charge increases, the 1s electrons are pulled into 
smaller and more tightly bound orbits, but this is a gradual process that is largely 
unaffected by the stacking of electrons into higher energy shells. There are no 
sudden changes in the 1s properties as a higher shell is filled and a still higher 
shell begins filling. 


X-Ray Transitions 


X rays, aS we discussed in Chapter 3, are electromagnetic radiations with 
wavelengths from approximately 0.01 to 10 nm (energies from 100 eV to 100 keV). 
In Chapter 3 we discussed the continuous X-ray spectrum emitted by accelerated 
electrons. In this section we are concerned with the discrete X-ray line spectra 
emitted by atoms. 

X rays are emitted in transitions between the more tightly bound inner electron 
energy levels of an atom. Under normal conditions all of the inner shells of an 
atom are filled, so X-ray transitions do not occur between these levels. However, 
when we remove one of the inner electrons, such as by ejecting a K electron 
following electron scattering or a photoelectric process, an electron from a higher 
subshell will rapidly make a transition to fill that vacancy, emitting an X-ray 
photon in the process. The energy of the photon is equal to the energy difference 
of the initial and final atomic levels of the electron that makes the transition. 

When we remove a Is electron, we are creating a vacancy in the K shell. The X 
rays that are emitted in the process of filling this vacancy are known as K-shell X 
rays, or simply K X rays. (These X rays are emitted in transitions that come from 
the L, M,N,...shells, but they are known by the vacancy that they fill, not by the 
shell from which they originate.) The K X ray that originates with the n = 2 shell 
(L shell) is known as the K,, X ray, and the K X rays originating from the M shell 
are known as Kg X rays. Figure 8.14 illustrates these transitions. 

If the bombarding electrons or photons knock loose an electron from the L 
shell, electrons from higher levels will drop down to fill this vacancy. The photons 
emitted in these transitions are known as L X rays. The lowest-energy X ray of the 
L series is known as L,, and the other L X rays are labeled in order of increasing 
energy as shown in Figure 8.14. 

It is possible to have an L X ray emitted directly following the K, X ray. A 
vacancy in the K shell can be filled by a transition from the Z shell, with the 
emission of the K, X ray. However, the electron that made the jump from the L 
shell left a vacancy there, which can be filled by an electron from a higher shell, 
with the accompanying emission of an L X ray. 

In a similar manner, we label the other X-ray series by M,N, and so forth. 
Figure 8.15 shows a sample X-ray spectrum emitted by silver. 


Moseley’s Law 


Let us consider in more detail the K, X ray, which (as shown in Figure 8.14) is 
emitted when an electron from the L shell drops down to fill a vacancy in the 
K shell. An electron in the Z shell is normally screened by the two Is electrons, 
and so it sees an effective nuclear charge of Z.¢ = Z — 2. When one of those 
1s electrons is removed in the creation of a K-shell vacancy, only the remaining 
single 1s electron shields the L shell, and so Z.~ = Z — 1. (In this calculation, we 
neglect the small screening effect of the outer electrons; their probability densities 
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FIGURE 8.15 Characteristic X-ray spectrum of silver, such as might 
be produced by 30 keV electrons striking a silver target. The conti- 
nuous distribution is a bremsstrahlung spectrum. 


are not zero within the L-shell orbits, but they are sufficiently small that their 
effect on Za can be neglected.) To a very good approximation, the K, X ray 
can thus be analyzed as a transition from the n = 2 level to the n = | level ina 
one-electron atom with Z.g¢ = Z — 1. Using Eq. 6.38 for the Bohr atom, we can 
find the energy of the K, transition in an atom of atomic number Z: 


AE = Ey — E, = (—13.6eV)(Z — 1)” (= - =) = (10.2eV)(Z— 1)? (8.4) 
Just as was the case for the K absorption edge, the energies of the K, X 
rays vary smoothly with atomic number and show no effects of atomic shells. If 
we plot VAE as a function of Z, we expect to obtain a straight line with slope 
(10.2 eV) 2 = 3.19eV 2. Figure 8.16 is an example of such a plot. The measured 
slope is 3.22 eV "2, in excellent agreement with what is expected from Eq. 8.4. The 
straight line intersects the x axis ata value very close to 1, as we expect from Eq. 8.4. 
This method gives us a powerful and direct way to determine the atomic 
number Z of an atom, as was first demonstrated in 1913 by the British physicist H. 
G. J. Moseley, who measured the K, (and other) X-ray energies of the elements 
and thus determined their atomic numbers. The dependence of the X-ray energies 
on Z given by Eq. 8.4 is known as Moseley’s law. Moseley was the first to 
demonstrate the type of linear relationship shown in Figure 8.16; such graphs 
are now known as Moseley plots. His discovery provided the first direct means 
of measuring the atomic numbers of the elements. Previously, the elements had 
been ordered in the periodic table according to increasing mass. Moseley found 
certain elements listed out of order, in which the element of higher Z happened to 
have the smaller mass (for example, cobalt and nickel or iodine and tellurium). 
He also found gaps corresponding to yet undiscovered elements; for example, the 
naturally radioactive element technetium (Z = 43) does not exist in nature and 
was not known at the time of Moseley’s work, but Moseley showed the existence 
of such a gap at Z = 43. 

The straight-line plot of Figure 8.16 is independent of our assumption regarding 
the exact value of the screening correction. That is, we could have written 
Zope = Z— k, where k is some unknown number, probably close to 1. The only 
change in our plot would be in the intercept. We would still have a straight line 
with the same slope. 

Moseley’s work was of great importance in the development of atomic physics. 
Working in the same year as Rutherford and Bohr, Moseley not only provided 
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Henry G. J. Moseley (1887-1915, Eng- 
land). His work on X-ray spectra pro- 
vided the first link between the chem- 
ical periodic table and atomic physics, 
but his brilliant career was cut short 
when he died on a World War I bat- 
tlefield. 
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confirmation of the Rutherford-Bohr model, he also demonstrated a direct link 
between atomic structure and the periodic table, which was previously a rather 
arbitrary ordering scheme of the elements but subsequently became a classification 
based on their electronic configurations. 


| Example 8.5 





Compute the energy of the K, X ray of sodium (Z = 11). The measured value is 1.04 keV. The small discrepancy 
may be due to the screening correction in Z,¢, which is not 


Solution exactly equal to 1. 


The energy can be found with the help of Eq. 8.4, 


AE = (10.2eV)(Z — 1)? = (10.2eV)(10)? = 1.02 keV 








Some measured X-ray energies in silver (Z = 47) are (b) Again from Figure 8.14, we see that 
AE(K,) = 21.990keV and AE(Kg) = 25.145 keV. The 


binding energy of the K electron in silver is E(K) = AE(K,) = E(L) — E(K) 

25.514 keV. From these data, find: (a) the energy of the L,, 

X ray, and (5) the binding energy of the L electron. or 

Solution ; E(L) = E(K) + AE(K,) 

(a) From Figure 8.14, we see that the energies are 

related by: = —25.514keV + 21.990 keV = —3.524keV 
AE(L,) + AE(K,) = AE(Kg) The binding energy of the L electron is therefore 

3.524 keV. 
or 


AE(L,) = AE(K) — AE(Ky) 
= 25.145 keV — 21.990 keV = 3.155 keV 


*8.6 ADDITION OF ANGULAR MOMENTA 


The properties of an alkali atom such as sodium are determined primarily by the 
single outer electron; if that electron has quantum numbers (n, /,m,,m,) then the 
entire atom behaves as if it had those same quantum numbers. In atoms with 
several electrons outside of filled subshells, this is not the case. For example, 
the electronic configuration of carbon (Z = 6) is 1s*2s*2p?. To find the angular 
momentum of carbon, we must combine the angular momenta of the two 2p 
electrons to find the total orbital angular momentum quantum number L and total 
magnetic quantum number /, that characterize the entire atom. 





*This is an optional section that may be skipped without loss of continuity. 
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Suppose we have an atom with two electrons outside of filled subshells. These 
electrons have quantum numbers (7,,/,,m),,m,,) and (19, l,m, my). The total 
orbital angular momentum of the atom is determined by the vector sum of the 
orbital angular momenta of the two electrons: 


L=L,+1L, (8.5) 


Each vector is related to its corresponding angular momentum quantum 
number by 


Wl=VE2+DA [Ly l=JVhG@tdDa ILl=JVEG+DA (86) 


These vectors do not add like ordinary vectors, but have special addition rules 
associated with quantized angular momentum. These rules enable us to find L and 
its associated magnetic quantum number M,. 


1. The maximum value of the total orbital angular momentum quantum 
number is 


Pie _ , + l, (8.7) 


2. The minimum value of the total orbital angular momentum quantum 
number is 


Linin = Ih — (8.8) 
3. The permitted values of L range from L,,;,, to Lyng in integer steps: 
L = y Peery beeen + 1, Livin + 2, eee ew (8.9) 


4. Thez component of the total angular momentum vector is found from the sum 
of the z components of the individual vectors: 


L=1,,+1,, (8.10) 
or, in terms of the magnetic quantum numbers, 
M, = Mi, + Mp (8.11) 


The permitted values of the total magnetic quantum number M, range from 
—L to +L in integer steps: 


M, = -L,-L+1,...,—-1,0,+1,...,2—-1,£ (8.12) 


An identical set of rules holds for coupling the spin angular momentum vectors 
to give the total spin angular momentum S. For two electrons, each of which has 
s = 1/), the total spin quantum number S can be 0 or 1. 

All filled subshells have L = 0 and S = 0, so we don’t need to consider filled 
subshells in analyzing the angular momentum of an atom. For this reason, filled 
subshells ordinarily do not contribute to the magnetic properties of atoms. 

For coupling more than two electrons, the procedure is first to couple the 
angular momenta of two electrons to give the maximum and minimum values of 
L. Then couple each allowed L to the angular momentum of the third electron to 
find the largest maximum and smallest minimum. This continues for all of the 
electrons in the unfilled subshell. 
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| Example 8.7 





Find the total orbital and spin quantum numbers for 
carbon. 


Solution 

Carbon has two 2p electrons outside filled subshells. Each 
of these electrons has / = 1. According to the rules for 
adding angular momenta, we have 


Imax = 1+1=2, Lmin = |1 — 1] = 0 


Thus Z = 0,1, or 2. For the spin angular momentum, we 
have 


= 


Smax = Z+3=1, Simin = | — |=0 


NI 


and so S = 0 or 1. Some combinations of Z and S might 
be forbidden by the Pauli principle. For example, to obtain 


L = 2, the two electrons must both have m, = +1. The two 
electrons must therefore have different values of m,, so 
S = 1 is not allowed when L = 2. 





| Example 8.8 


Find the total orbital and spin quantum numbers for 
nitrogen. 


Solution 

Nitrogen has three 2p electrons, each with / = 1, outside 
filled subshells. If we add the first two, we get Lina, = 2 and 
Lmin = 9, aS in Example 8.7, so that L = 0, 1, or 2. We now 


Smax = 1+ = 3, Smin = 10 — 3] = 5 


NI 


The resulting values of S are !/2 and 3/2 (from the min- 
imum to the maximum in integer steps). Once again, the 
Pauli principle may forbid certain combinations of L and 
S. The state with L = 3 cannot exist at all, because all three 


electrons must have m, = +1, and assigning m, quantum 
numbers will then result in two electrons with the same m, 
and m,, which is forbidden by the Pauli principle. 


couple the third / = | electron to each of these values to find 
the largest maximum and smallest minimum, which give 


Emax = 2+1 = 3, Lmin = |1 — 1] =0 

and so L=0,1,2, or 3. For the spin vectors, we again 
couple the first two to give S..4, = 1 and Syi, = 0. Adding 
the third s = !/) electron, we have 


The two 2p electrons of carbon can combine to give L = 0, 1, or 2 and S = 0 or 
1. The ground state of carbon will be identified by only one particular choice of L 
and S. How do we know which of these combinations will be the ground state? The 
rules for finding the ground state quantum numbers are known as Hund’s rules: 


1. First find the maximum value of the total spin magnetic quantum number M, 
consistent with the Pauli principle. Then 


S = Ms max (8.13) 


2. Next, for that My, find the maximum value of M, consistent with the Pauli 
principle. Then 


L= Mi max (8.14) 


In the case of carbon, the maximum value of M, is +1, obtained when the two 
valence electrons both have m, = +!/2. Thus S = 1. With only two electrons in 
the 2p shell, the Pauli principle places no restrictions on S; in fact, three electrons 
in the 2p shell can be assigned m, = + !/2. Our next task is to find the maximum 
value of M,. The maximum value of m, for the first p electron is +1. The second 
p electron cannot also have m, = +1, because that would give both electrons the 


8.6 | Addition of Angular Momenta 247 


same set of quantum numbers, in violation of the Pauli principle. The maximum 
value of m, for the second electron is 0, so Mz max = +1 and L = 1. The ground 


state of carbon is therefore characterized by S = | and Z = 1. 





Use Hund’s rules to find the ground-state quantum numbers 
of nitrogen. 


Solution 

The electronic configuration of nitrogen is 1s*2s?2p*. We 
begin by maximizing the total M, for the three 2p elec- 
trons. Three electrons in the p subshell are permitted by 
the Pauli principle to have m, = +!/2, so the maximum 


value of M, is 3/2, and therefore S is 3/2. Each of the three 
electrons has quantum numbers (2, 1, m,,+ 1/2). To maxi- 
mize M, we assign the first electron the maximum value 
of m,—namely, +1. The maximum value of m, left for 
the second electron is 0, and the third electron must there- 
fore have m, = —1. The total VM, is 1+ 0+ (—1) = 0, so 
L=0. Thus L = 0,S = 3/2 are the ground-state quantum 
numbers for nitrogen. 





| Example 8.10 





Find the ground-state L and S of oxygen (Z = 8). 


Solution 

The electronic configuration of oxygen is 1s*2s?2p*. 
Because only three electrons in the p subshell can 
have m,=+!/, the fourth must have m, = —!/2, so 


M5 max = Y2t+ Yat Yot+(—'2)=+1, and it follows 
that S = 1. To find LZ, we note that, as for nitrogen, the three 
electrons with m, = +!'/. have m, = +1,0, and —1, and 
we maximize M, by giving the fourth electron m, = +1. 
Thus M; max = +1, andl = 1. 





Let us look now at the energy levels of helium. The ground-state configuration 
of helium is 1s?. Both electrons are s electrons, with / = 0, and so the only possible 
value of L is zero. Because both electrons have m, = 0, the Pauli principle requires 
that the spin of the two electrons be opposite, so that one has m, = +!/2 and the 
other has m, = — !/2. The only possible total My is therefore zero, so the ground 
state of helium has L = 0 and S = 0. The first excited state has configuration 
1s!2s', Both electrons still have / = 0, so we must again have L = 0. However, 
the total spin S can now be 0 or 1, because the Pauli principle does not restrict m, 
in this case—the two electrons already have different principal quantum numbers 
n, and so there is nothing to prevent them from having the same m,. There are, 
therefore, two “first excited states” of helium, one with L = 0 and S = 0, and 
another with L = 0 and § = 1. (Both of these states have configuration 1s!2s!.) 
A state with S = 0 is called a singlet state (because there is only a single possible 
Msg value), and a state with S = 1 is called a triplet state (because there are three 
possible My values: +1,0,—1). 

The classification of states into singlet and triplet is important when we 
consider the selection rules for transitions between states; these selection rules 
tell us which transitions are allowed (and therefore likely) to occur and which are 
not. The selection rules, which involve both L and S, are 


AL =0,+1 
AS =0 


(8.15) 
(8.16) 


(There are no selection rules for n.) Of course, the selection rule AJ = +1 for the 
single electron that makes the transition still applies. For the two 1s!2s! states 
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FIGURE 8.17 Energy-level diagram for helium. The states are grouped 
-1 =e into singlets (S = 0) and triplets (S = 1). Some of the transitions in 
Loa L=1. p 2p'3d! the optical and ultraviolet regions are shown. Transitions marked with 
5|_L=3  L=3 an X would violate the A/ = +1 selection rule. 
L=l1 L=1 
L=0 | Only! of helium, the A/ rule does not permit either state to make transitions to the 1s? 
a a 2 ground state (2s to 1s would be A/ = 0), and in addition, the AS rule forbids the 
triplet (S = 1) states from decaying to the S = 0 ground state. These transitions 
ges ra ¢ 2p'3s'| can thus occur only by violating these selection rules. Because that is a very 








unlikely event, the transitions occur with very low probability. Energy levels that 
have a low probability of decay must “live” for a long time before they decay; 
5 such states are known as metastable states. 

Figure 8.17 shows the energy levels and transitions in helium. The singlet 
and triplet levels are grouped separately, because transitions between singlet and 
triplet levels would violate the AS = 0 selection rule. 

Figure 8.18 shows the energy-level diagram of carbon. Notice the increasing 
7 complexity of the diagram, compared with the alkali metals and even with 
helium. This follows from the coupling of two electrons, both of whose / values 
may be different from zero. We have already discussed how the 2p? configuration 
can give L=0,1, or 2 and S=0 or 1. Only one of these (L = 1,5 = 1) is 
the ground state of carbon; the others are excited states. More excited states 
9 can be obtained by promoting one of the 2p electrons to a higher level, giving 
configurations of 2p!3s! (L = 1,5 =0 or 1), 2p!3p! (L=0,1, or 2; S=0 or 
1), 2p'3d! (L = 1,2, or 3; S=0 or 1), and so forth. Imagine the difficulty of 
analyzing the energy level diagram of the rare earths or actinides, which have f 
subshells (/ = 3) with as many as 14 electrons! 
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FIGURE 8.18 Energy-level diagram QU Rad 


for carbon. Each group of levels is 

labeled with the electron configura- There are three means by which radiation can interact with the energy levels of 
tion. Each individual level is labeled atoms (depicted in Figure 8.19). The first two we have already discussed. In the 
with the total Z and S. first kind of interaction, an atom in an excited state makes a transition to a lower 














state, with the emission of a photon. (In all the examples we consider here, the 
photon energy is equal to the energy difference of the two atomic states.) This is 
spontaneous emission, which we represent as 


atom* — atom + photon 


where the asterisk indicates an excited state. 

The second interaction, induced absorption, is responsible for absorption 
spectra and resonance absorption. An atom in the ground state absorbs a photon 
(of the proper energy) and makes a transition to an excited state. Symbolically: 


atom + photon — atom* 


The third interaction, which is responsible for the operation of the laser, is 
induced (or stimulated) emission. In this process, an atom is initially in an excited 
state. A passing photon of just the right energy (again, equal to the energy 
difference of the two levels) induces the atom to emit a photon and make a 
transition to the lower, or ground, state. (Of course, it would eventually have 
made that transition left on its own, but it makes it sooner after being prodded by 
the passing photon.) Symbolically, 


atom* + photon — atom + 2 photons 


The significant detail is that the two photons that emerge are traveling in exactly the 
same direction with exactly the same energy, and the associated electromagnetic 
waves are perfectly in phase (coherent). 

Suppose we have a collection of atoms, all in the same excited state, as shown 
in Figure 8.20. A photon passes the first atom, causing induced emission and 
resulting in two photons. Each of these two photons causes an induced emission 
process, resulting in four photons. This process continues, doubling the number 
of photons at each step, until we build up an intense beam of photons, all coherent 
and moving in the same direction. In its simplest interpretation, this is the basis 
of operation of the laser. (The word /aser is an acronym for Light Amplification 
by Stimulated Emission of Radiation.) 

This simple model for a laser will not work, for several reasons. First, it 
is difficult to keep a collection of atoms in their excited states until they are 
stimulated to emit the photon (we don’t want any spontaneous emission). A 
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FIGURE 8.20 Buildup of intense beam in a laser. Each emitted 
photon interacts with an excited atom and produces two photons. 
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FIGURE 8.19 Interactions of radia- 
tion with atomic energy levels. 
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FIGURE 8.21 A three-level atom. 


Short-lived 
state 





Metastable 
state 





Wie Wt 
Wie 
Lasing 
transition 


Pumping 





Y Short-lived 
state 





Ground 
state 





FIGURE 8.22 A four-level atom. 


second reason is that atoms that happen to be in their ground state undergo 
absorption and thus remove photons from the beam as it builds up. 

To solve these problems, we must achieve a population inversion—in a 
collection of atoms, there must be more atoms in the upper state than in the lower 
state. This is called an “inversion” because under normal conditions at thermal 
equilibrium, the lower state always has the greater population. The “inversion” is 
thus an unnatural situation that must be achieved by artificial means, because it is 
essential for the operation of the laser. 

The first laser, which was constructed by T. H. Maiman in 1960, was based ona 
three-level atom (Figure 8.21). The laser medium is a solid ruby rod, in which the 
chromium atoms are responsible for the action of the laser. The atoms, originally 
in the ground state, are “pumped” into the excited state by an external source 
of energy (a burst of light from a flash lamp that surrounds the ruby rod). The 
excited state decays very rapidly (by spontaneous emission) to a lower excited 
state, which is a metastable state—the atom remains in that level for a relatively 
long time, perhaps 10-7 s, compared with 10~* s for the short-lived states. The 
transition from the metastable state to the ground state is the “lasing” transition, 
resulting from stimulated emission by a passing photon. 

If the pumping action is successful, there are more atoms in the metastable 
state than in the ground state, and we have achieved a population inversion. 
However, as the lasing transition occurs, the population of the ground state is 
increased, thereby upsetting the population inversion. This excess of population 
in the ground state allows absorption of the lasing transition, thereby removing 
photons that might contribute to the lasing action. 

The four-level laser illustrated in Figure 8.22 relieves this remaining difficulty. 
The ground state is pumped to an excited state that decays rapidly to the 
metastable state, as with the three-level laser. The lasing transition proceeds from 
the metastable state to yet another excited state, which in turn decays rapidly to 
the ground state. The atom in its ground state thus cannot absorb at the energy of 
the lasing transition, and we have a workable laser. Because the lower short-lived 
state decays rapidly, its population is always smaller than that of the metastable 
state, which maintains the population inversion. 

A common example of the four-level laser is the familiar helium-neon laser, 
which operates with a mixture of helium and neon gas (about 90% helium). The 
important energy levels of He and Ne are shown in Figure 8.23. An electrical 
current in the gas “pumps” the helium from its ground state to the excited state 
at an energy of about 20.6eV. This is a metastable state of helium—the atom 
remains in that state for a relatively long time because a 2s electron is not permitted 
to return to the Ls level by photon emission. Occasionally, an excited helium 
atom collides with a ground-state neon atom. When this occurs, the 20.6eV of 
excitation energy may be transferred to the neon atom, because neon happens to 
have an excited state at 20.6 eV, and the helium atom returns to its ground state. 
Symbolically, 


helium* + neon — helium + neon* 


where the excited state is indicated by the asterisk. The excited state of neon 
corresponds to removing one electron from the filled 2p subshell and promoting 
it to the 5s subshell. From there it decays to the 3p level and eventually returns to 
the 2p ground state. Figure 8.23 illustrates this sequence of events and the level 
schemes. (The level shown with a dashed line, the neon 3s level, is not important 
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FIGURE 8.23 Sequence of transitions in a He-Ne laser. 


for the basic operation of the laser, but it is necessary as an intermediate step in 
the return to the neon ground state, because the A/ = 0 transition 3p — 2p is not 
allowed, but the sequence 3p — 3s — 2p is permitted.) 

At any given time, there are more neon atoms in the 5s state than in the 3p 
state, because the good energy matchup of the 5s state with the helium excited 
state gives a high probability of the 5s state in neon being excited. The 3p state, 
on the other hand, decays rapidly. This provides the population inversion that is 
needed for the laser. 

In the helium-neon laser, the gases are enclosed in a narrow tube (Figure 8.24). 
Occasionally a neon atom in the 5s state spontaneously emits a photon (at a 
wavelength of 632.8nm) parallel to the axis of the tube. This photon causes 
stimulated emission by other atoms, and a beam of coherent (in-phase) radiation 
eventually builds up traveling along the tube axis. Mirrors are carefully aligned at 
the ends of the tube to help in the formation of the coherent wave, as it bounces 
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FIGURE 8.24 Schematic diagram of a He-Ne laser. 
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back and forth between the two ends of the tube, causing additional stimulated 
emission. One of the mirrors is only partially silvered, allowing a portion of the 
beam to escape through one end. 

The laser is not a particularly efficient device; the small helium-neon lasers 
you have probably seen used for laboratory or demonstration experiments have 
a light output of perhaps a few milliwatts; the electric power required to operate 
such a device may be of the order of 10 to 100 W, and thus the efficiency 
(power out ~ power in) of such a device is only about 10~* to 10~>. It is the 
coherence and directionality of the laser beam and its energy density that make 
the laser such a useful device—its power can be concentrated in a beam only 
a few millimeters in diameter, and thus even a small laser can deliver 100 to 
1000 W/m. Larger lasers in the megawatt (10° W) range are presently readily 
available, and research laboratories are using lasers in the 100 terawatt (10!* W) 
range for special applications. These powerful lasers do not operate continuously, 
but are instead pulsed, producing short (perhaps 10~? s) pulses at rates of order 
100 Hz. (Such a pulse is, in fact, an excellent example of a wave packet.) 
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Continue Figure 8.1 upward, showing the next two major 
groups. What will be the atomic number of the next inert 
gas below Rn? What will be the structure of the eighth row 
(period) of the periodic table? Where do you expect the 
first g subshell to begin filling? What properties would you 
expect the g-subshell elements to have? What will be the 
atomic number of the second inert gas below radon? 

Why do the 4s and 3d subshells appear so close in energy, 
when they belong to different principal quantum numbers n? 
Would you expect element 107 to be a good conductor or 
a poor conductor? How about element 111? Do you expect 
element 112 to be paramagnetic or diamagnetic? 


Zirconium frequently is present as an impurity in hafnium 
metal. Why? 

Do you expect ytterbium (Yb) to become ferromagnetic 
at sufficiently low temperatures? What type of magnetic 
behavior would be expected at ordinary temperatures for 
polonium (Po)? For francium (Fr)? 

As we move across the series of transition metal or rare 
earth elements, we add electrons to the d or f subshells. 
In chemical compounds, these elements often show valence 
states of +2, which correspond to removing two s electrons. 
Explain this apparent paradox. 


10. 


11. 


12. 


13. 


14. 


15. 


Why do the rare earth (lanthanide) elements have such sim- 
ilar chemical properties? What property might you use to 
distinguish lanthanide atoms from one another? 

Explain why the Bohr theory gives a poor accounting of 
optical transitions but does well in predicting the energies 
of X-ray transitions. 

What can you conclude about the electronic configuration of 
an atom that has both L = 0 and S = 0 in the ground state? 

Suppose we do a Stern-Gerlach experiment using an atom 
that has angular momentum quantum numbers L and S in 
its ground state. Into how many components will the beam 
split? Do you expect them to be equally spaced? 

What is the degeneracy of a state of total orbital angular 
momentum L that has S = 0? What is the degeneracy of 
a state of total spin angular momentum S that has L = 0? 
What is the total degeneracy of a state in which both LZ and 
S are nonzero? 

What L and S values must an atom have in order to show the 
normal Zeeman effect? Does this apply only to the ground 
state or to excited states also? Can an atom show the normal 
Zeeman effect in some transitions and the anomalous Zee- 
man effect in other transitions? Could the same atom even 
show no Zeeman effect at all in some transitions? 

Based on the rules for coupling electron / and s values to 
give the total LZ and S, explain why filled subshells don’t 
contribute to the magnetic properties of an atom. 

If an atom in its ground state has S = 0, can you infer 
whether it has an even or an odd number of electrons? What 
ifL = 0? 

The Z atomic shell actually contains three distinct levels: 
a 2s level and two 2p levels (a fine-structure doublet). If 


8.1 The Pauli Exclusion Principle 


1. 


(a) List the six possible sets of quantum numbers (, /, m), m,) 
ofa 2p electron. (b) Suppose we have an atom such as carbon, 
which has two 2p electrons. Ignoring the Pauli principle, 
how many different possible combinations of quantum num- 
bers of the two electrons are there? (c) How many of the 
possible combinations of part (b) are eliminated by applying 
the Pauli principle? (d) Suppose carbon is in an excited state 
with configuration 2p!3p!. Does the Pauli principle restrict 
the choice of quantum numbers for the electrons? How many 
different sets of quantum numbers are possible for the two 
electrons? 

Nitrogen (Z = 7) has three electrons in the 2p level (in 
addition to two electrons each in the ls and 2s levels). 
(a) Consistent with the Pauli principle, what is the maxi- 
mum possible value of the total m, of all seven electrons? 
(b) List the quantum numbers of the three 2p electrons that 
result in the largest total m,. (c) If the electrons in the 2p 


16. 


17. 


18. 


19, 


20. 


21, 
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we look carefully at the K, X ray under high resolution, 
we see two, not three, different components. Explain this 
discrepancy. 

The K,, energies computed using Eq. 8.4 are about 0.1% 
low for Z = 20, 1% low for Z = 40, and 10% low for 
Z = 80. Why does the simple theory fail for large Z? Could 
it be because the screening effect has not been handled 
correctly and that Z,- is not Z — 1? If not, can you suggest 
an alternative reason? 

The first excited state in sodium is a fine-structure dou- 
blet; the wavelengths emitted in the decay of these states 
are 589.59nm and 589.00nm, a difference of 0.59 nm. 
The excited 4s! state in sodium (see Figure 8.4) decays 
to the 3p doublet with the emission of radiation at the 
wavelengths 1138.15nm and 1140.38nm, a difference of 
2.23 nm. Explain how the 3p fine structure can give a wave- 
length difference of 0.59 nm in one case and 2.23 nm in the 
other case. 

Suppose we had a three-level atom, like that of Figure 8.21, 
in which the metastable state were the higher excited state; 
the lasing transition would then be the upper transition. 
Does this atom solve the problem of absorption of the lasing 
transition? Would such an atom make a good laser? 

How does a laser beam differ from a point source of light? 
Contrast the change in beam intensity with distance from 
the source for a laser and a point source. 

Explain what is meant by a population inversion and why it 
is necessary for the operation of a laser. 

How could you demonstrate that laser light is coherent? 
What would be the result of the same experiment using an 
ordinary monochromatic source? A white light source? 


level occupy states that maximize m,, what would be the 
maximum possible value for the total m,? (d) What would 
be the maximum possible total m, if the three 2p electrons 
were in states that did not maximize m,? 

(a) How many different sets of quantum numbers 
(n,1,m;,m,) are possible for an electron in the 4f level? 
(b) Suppose a certain atom has three electrons in the 4f 
level. What is the maximum possible value of the total m, 
of the three electrons? (c) What is the maximum possible 
total m, of three 4f electrons? (d) Suppose an atom has ten 
electrons in the 4f level. What is the maximum possible 
value of the total m, of the ten 4f electrons? (e) What is the 
maximum possible total m, of ten 4f electrons? 


8.2 Electronic States in Many-Electron Atoms 


4. 


(a) Suppose a beryllium atom (Z = 4) absorbs energy (such 
as from a beam of photons) that pushes one of the elec- 
trons to an excited state. If the photon energy is set at the 
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minimum necessary for this to occur, from which subshell 
does the electron make the transition and to which subshell 
does it jump? (b) Suppose the same experiment is done with 
neon (Z = 10). At the minimum energy for absorption, from 
which subshell does the electron make the transition and 
to which subshell does it jump? (c) Would you expect the 
minimum absorption energy for beryllium to be larger or 
smaller than the minimum energy for neon? Explain. 

(a) List all elements with a p* configuration. (b) List all 
elements with a d’ configuration. 

Give the electronic configuration of (a) P; (b) V; (c) Sb; 
(d) Pb. 

(a) What is the electronic configuration of Fe? (b) In its 
ground state, what is the maximum possible total m, of its 
electrons? (c) When the electrons have their maximum pos- 
sible total m,, what is the maximum total m,? (d) Suppose 
one of the d electrons is excited to the next highest level. 
What is the maximum possible total m,, and when m, has its 
maximum total what is the maximum total m,? 


8.3 Outer Electrons: Screening and Optical Transitions 


8. 


10. 


Lt. 


The ground state of singly ionized lithium (Z = 3) is 1s?. 
Use the electron screening model to predict the energies 
of the 1s!2p' and 1s!3d! excited states in singly ion- 
ized lithium. Compare your predictions with the measured 
energies (respectively —13.4eV and —6.0 eV). 

The ground state of neutral beryllium (Z = 4) is 1s*2s*. Use 
the electron screening model to predict the energies of the 
following excited states: 1s?2s!3p! (measured —2.02 eV) 
and 1s*2s!4d!(—0.90 eV). 

Using the wavelengths given in Figure 8.4, compute the 
energy difference between the 3d and 4d states in lithium; 
do the same for sodium. Compare those values with the cor- 
responding n = 4 to n = 3 energy difference in hydrogen. 
Why is the agreement so good, considering the different 
values of Z? 

(a) Using the information for lithium given in Figure 8.4, 
compute the energy difference of the 3p and 3d states. 
(b) Compute the energy of the 3s,4s, and 5s states above 
the ground state. (c) The ionization energy of lithium in its 
ground state is 5.39 eV. What is the ionization energy of the 
2p state? Of the 3s state? 


8.5 Inner Electrons: Absorption Edges and X Rays 


12. 


13, 


Acertain element emits aK, X ray of wavelength 0.1940 nm. 
Identify the element. 

Compute the K, X ray energies of calcium (Z = 20), zir- 
conium (Z = 40), and mercury (Z = 80). Compare with the 
measured values of 3.69 keV, 15.8 keV, and 70.8 keV. (See 
Question 16). 


8.6 Addition of Angular Momenta 


14. 


15. 


16. 


17. 


18. 


Chromium has the electron configuration 4s'3d° beyond the 
inert argon core. What are the ground-state Z and S values? 
Use Hund’s rules to find the ground-state L and S of 
(a) Ce, configuration [Xe]6s?4f'5d!; (b) Gd, configuration 
[Xe]6s74f75d!; (c) Pt, configuration [Xe]6s!4f'45a?. 
Using Hund’s rules, find the ground-state L and S of 
(a) fluorine (Z = 9); (b) magnesium (Z = 12); (c) titanium 
(Z = 22); (d) iron (Z = 26). 

A certain excited state of an atom has the configuration 
4d'5d'. What are the possible L and S values? 

Use the degeneracies of the states with all possible total L 
and S to find how many different levels the 2p!3p! excited 
state of carbon includes. (See Figure 8.18.) Compare this 
result with the result of counting the individual m, and m, 
values from Problem 1(d). (See also Question 11.) 


8.7 Lasers 


19. 


A small helium-neon laser produces a light beam with 
an average power of 3.5 mW and a diameter of 2.4mm. 
(a) How many photons per second are emitted by the laser? 
(6) What is the amplitude of the electric field of the light 
wave? Compare this result with the electric field at a distance 
of 1 m from an incandescent light bulb that emits 100 W of 
visible light. 


General Problems 


20. 


Zi, 


22, 


(a) How many different possible ways are there to assign the 
sets of quantum numbers to the four 2p electrons in oxygen 
(Z = 8)? (6) List all possible values of the total m, for the 
four electrons. (c) List all possible values of the total m, of 
the four electrons. (d) If the total m, has its largest possible 
value, what are the possible values of the total m,? (e) If the 
total m, has its largest possible value, what are the possible 
values of the total m,? 

(a) The ionization energy of sodium is 5.14eV. What is 
the effective charge seen by the outer electron? (b) If the 
3s electron of a sodium atom is moved to the 4f state, the 
measured binding energy is 0.85 eV. What is the effective 
charge seen by an electron in this state? 

Draw a Moseley plot, similar to Figure 8.16, for the K, X 
rays using the following energies in keV: 


Ne 0.858 Mn 6.51 Zr = 17.7 
P 2.14 Zn 9.57 Rh 22.8 
Ca 4.02 Br 13.3 Sn 28.4 


Determine the slope and compare with the expected value. 
(Equation 8.4 applies only to K, X rays; you will need to 


23. 


24. 


derive a similar equation for the Kg X rays.) Determine the 
z-axis intercept and give its interpretation. 

Draw a Moseley plot, similar to Figure 8.16, for the L, X 
rays using the following energies in keV: 


Mn 0.721 Rh 2.89 
Zn 1.11 Sn 3.71 
Br 1.60 Cs 4.65 
Zr 2.06 Nd 5.72 


Give interpretations of the slope and intercept. 

Because of the fine-structure splitting of the 3p state, the 
3p — 3s transition in sodium actually consists of two closely 
spaced lines of wavelengths 589.00nm and 589.59 nm. 
Assuming a magnetic moment of one Bohr magneton, find 


25. 


26. 
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the effective magnetic field that produces the fine-structure 
splitting of the 3p state of sodium. 


(a) What is the longest wavelength of the absorption spec- 
trum of lithium? (6) What is the longest wavelength of the 
absorption spectrum of helium? In what region of the spec- 
trum does this occur? (c) What are the shortest wavelengths 
in the absorption spectra of helium and lithium? In what 
region of the electromagnetic spectrum are these? 


Using the wavelengths given in Figure 8.17, compute the 
energy difference between the 1s!4p! and 1s'3p! singlet 
(S = 0) states in helium. Compare this energy difference 
with the value expected using the Bohr model, assuming 
that the p electron is screened by the s electron. Repeat the 
calculation for the 3d and 4d triplet (S = 1) states. 





MOLECULAR STRUCTURE 
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such as DNA. The photo shows a computer model of Cg, a spherical arrangement of 60 
carbon atoms in pentagons and hexagons, known as a “‘buckyball.”’ 





258 Chapter 9 | Molecular Structure 


In this chapter we consider the combination of atoms into molecules, the excited 
states of molecules, and the ways that molecules can absorb and emit radiation. 
From a variety of experiments we learn that the spacing of atoms in molecules is 
of the order of 0.1 nm, and that the binding energy of an atom in a molecule is 
of the order of electron-volts. This spacing and binding energy are characteristic 
of electronic orbits, which suggests that the forces that bind molecules together 
originate with the electrons. The negatively charged electrons provide the binding 
that overcomes the Coulomb repulsion of the positively charged nuclei of the 
atoms in the molecule. 

When atoms are brought together to form molecules, the atomic states of the 
electrons change into molecular states. These states are filled in the order of 
increasing energy by the valence electrons of the atoms of the molecule. The 
probability densities of the occupied molecular states determine the nature of 
molecular bonds and the structure and properties of molecules, including their 
geometrical shapes. 

Just as we began to study atomic physics by looking at the simplest atom, we 
begin our study of molecular physics with the simplest molecule, Me the singly 
ionized hydrogen molecule. We next turn to other simple molecules, such as H, 
and NaCl, and finally we look at how our previous knowledge of atomic wave 
functions can help us to understand the molecular states that form the basis of 
organic chemistry. 

We will also study ways other than electronic excitations that molecules can 
absorb and emit electromagnetic radiation. These radiations give a distinctive 
signature of the molecule and its structure. Molecular spectroscopy, the study 
of these radiations, finds application in such diverse areas as identification of 
atmospheric pollutants and the search for life in outer space. 


9.1 THE HYDROGEN MOLECULE 


Let’s first look at how the wave functions of the atomic electrons can lead to 
the binding together of atoms into stable molecules. Even though the negatively 
charged electrons provide the attractive force that overcomes the Coulomb 
repulsion of the positively charged atomic nuclei, it is perhaps not immediately 
obvious how stable molecules form at all because there is also a Coulomb repulsion 
of the electrons of one atom for those of another. The key to understanding this 
problem is the existence of the spatial probability densities of atomic orbits, such 
as we calculated for hydrogen and illustrated in Chapter 7. These probability 
densities are frequently not spherically symmetric, and very often may show 
overwhelming preferences for one spatial direction over another. 

A complete understanding of the effect of the electrons on molecular binding 
is in general made difficult by what also complicates atomic structure—there are 
too many electrons present for us to be able to write down and solve the equations 
that govern the structure of the atom or molecule. We therefore use the same 
tactic to study molecular structure that we used for atomic structure: we begin 
with a molecule that has only one electron. Such a molecule is H}, the hydrogen 
molecule ion, which results when we remove an electron from a molecule of 
ordinary hydrogen, H,. 

Before we discuss the wave mechanical properties of Hj, let’s try to guess 
what holds this molecule together. We first realize that it is not correct to think 
of isk as an atom of hydrogen (proton plus electron) joined to a second proton. 
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FIGURE 9.1 The electron wave functions for two hydrogen atoms 
separated by a large distance. 


The atom of hydrogen in such a combination is electrically neutral, so there is 
no electrostatic Coulomb force to hold the two pieces together. In this kind of 
molecule, at least, it is apparently not correct to identify the electron as belonging 
exclusively to one or the other of the components. The electron must somehow be 
shared between the two parts. The electron must spend a significant part of its time 
in the region between the two protons. In the language of quantum mechanics, the 
electron’s probability density must have a large value in that region. 

As we learned in Chapter 7, an electron in the ground state of hydrogen 
has an energy of —13.6eV, a wave function w = (zap) ~'/2e~"/%, where ay is 
the Bohr radius, and a probability density proportional to y?. Figure 9.1 shows 
the wave function for an electron that could be bound to either of two protons 
separated by a large distance. As we bring the two protons closer together, the 
wave functions begin to overlap, and we must combine them according to the 
rules of quantum mechanics—first add the wave functions, then square the result 
to find the combined probability density. (Note that this gives a very different 
result from first squaring, then adding.) 

We can combine these two wave functions in two different ways, depending on 
whether they have the same signs or opposite signs. The absolute sign of a wave 
function is arbitrary. When we calculate the normalization constant of a wave 
function, we actually compute its square. We could choose either the positive or 
the negative root; for convenience we usually choose the positive one. When we 
calculate probability densities 7 for a single wave function, the choice of sign 
becomes irrelevant. However, when we combine different wave functions, their 
relative signs determine whether the two functions add or subtract, which can 
result in very different probability densities. 

Consider the two different combinations of wave functions shown in Figure 9.2. 
In one case (Figure 9.2a), the two wave functions have the same sign, and in the 
other case (Figure 9.2b) they have opposite signs. This has a substantial effect 
on the probability distributions, which are shown in Figure 9.3. The probability 
density obtained from squaring yy, + W, (Figure 9.3a) has relatively large values 
in the region between the two protons. This suggests a concentration of negative 
charge between the protons, which can supply the Coulomb attraction to pull 
the two protons together and form a stable molecule. The square of yy, — W, 
(Figure 9.3b), however, gives a vanishing probability density midway between 
the protons and thus a small density of negative charge in the region between the 
protons. There is not enough negative charge to overcome the Coulomb repulsion 
of the protons, and as a result this combination of wave functions does not lead to 
the formation of a stable molecule. 


Binding Energy of Hi 


There are two contributions to the energy of the Hy molecule: the Coulomb 
repulsion of the two positively charged protons for each other, and the attraction 
of the combination of the two protons for the negatively charged electron. 
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(b) 


FIGURE 9.2 ~The overlap of two 
hydrogenic wave functions. The wave 
functions are indicated by the dashed 
lines, and their sum by the solid line. 
In (a), the two wave functions have 
the same sign, while in (5) they have 
opposite signs. 
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FIGURE 9.3 The probability densities 
corresponding to the two combined 
wave functions of Figure 9.2. 
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FIGURE 9.4 Dependence of energy 
on separation distance R for Hy. 


The Coulomb repulsion energy of the protons is positive, and the energy of 
the attraction of the electron by the protons is negative. For a stable molecule 
to form, the total energy must be negative, so the critical question is whether 
the electrons can provide enough negative energy of attraction to overcome the 
positive repulsion energy of the protons. 

To find the conditions necessary for a stable Hy ion to form, let’s look at how 
the various contributions to the energy of the ion depend on the separation distance 
R between the two protons. The Coulomb potential energy that characterizes the 
repulsion of the bare protons is U, = e’/4m€)R; this function is plotted in 
Figure 9.4. To find the electron energy as a function of R, we first consider the 
case when the two protons are very far apart. In this case the electron is in the 
ground state orbit about one of the protons, for which E = —13.6eV. As we 
bring the protons together, the electron becomes more tightly bound (because it 
is attracted by both protons) and its energy becomes more negative. As R > 0, 
the system approaches a single atom with Z = 2. For the wave function y, + W 
(Figure 9.2a), the combined wave function has a maximum at R = 0 and resembles 
the ground-state wave function of an atom with Z = 2. Recalling the result from 
Chapter 6 for the electron energy in a hydrogen-like atom, 


VAL 
E, = (-13.6eV) (9.1) 
n 


where n is the principal quantum number, we find the energy of a ground-state 
electron for Z = 2 to be —54.4eV. The energy corresponding to the sum of the 
two wave functions, which we label £, , therefore has the value —13.6 eV at large 
Rand approaches the value —54.4 eV at small R. The result of an exact calculation 
of E, is shown in Figure 9.4. 

For the combination corresponding to the difference between the two wave 
functions, the energy is once again —13.6eV for large R. As R— 0, the 
combined wave function approaches 0 (Figure 9.25). The lowest energy level 
with a wave function that vanishes at R = 0 is the 2p state, for which the energy 
ina Z = 2 hydrogenlike atom is —13.6eV. The energy E_ corresponding to the 
wave function y, — > therefore has the value —13.6eV for both large R and 
small R. Its exact form is shown in Figure 9.4. 

The total energy of the hydrogen molecule ion is the sum of the proton 
energy U, and the electron energy E, or E_. These two sums are also plotted 
in Figure 9.4. You can see that the combination U, + £_ has no minimum and 
therefore no stable bound state. The wave function wy, — w, does not lead to a 
stable configuration for the hydrogen molecule ion, just as we originally suspected. 

The sum U, + £, gives the stable configuration of the ion, for which the 
equilibrium condition occurs at the point where U, + E,, has its minimum value. 
The minimum occurs at a separation R,, = 0.106 nm and an energy of —16.3 eV. 
The binding energy B of Hy is the energy necessary to take apart the ion into H 
and H* and corresponds to the depth of the potential energy minimum of U; +24 
in Figure 9.4: 


B= E(H+H?*) — E(Hf) = —13.6eV — (—16.3 eV) = 2.7eV (9.2) 


Note that we have defined molecular binding energy as the energy difference 
between the separate components (H and H*) and the combined system (Hf). 

It is interesting to note that the stability is achieved at R,, = 2a. In Chapter 7 
we learned that the radial probability density for the ls state of hydrogen has its 


maximum value at r = ay. Thus the stable configuration of the H} ion is such 
that the maximum in the radial probability density for a single H atom would 
fall exactly in the middle of the molecule! This is once again consistent with our 
expectations for the structure of HS —the electron must spend most of its time 
between the two protons. 

In summary, from our study of this simple molecule we have learned that an 
important feature of molecular bonding concerns the sharing of a single electron 
by two atoms of the molecule. This sharing is responsible for the stability of the 
molecule. With this in mind we can now add a second electron and consider the 
H, molecule. 


The Hs Molecule 


Suppose we have two hydrogen atoms separated by a very large distance. 
Associated with each atom there is a 1s electronic state, at an energy of —13.6eV, 
because the atoms are so far apart that there is no interaction between the electrons. 
As we bring the atoms closer together to form a H, molecule, the electron wave 
functions begin to overlap, so that the electrons are “shared” between the two 
atoms. As we have seen in the previous discussion, this can occur in such a way 
that the two electron wave functions add in the region between the two protons, 
giving a stable molecule, or subtract, leading to no stable molecule. The separate, 
individual electronic states of the atoms now become molecular states. 

Notice that, as shown in Figure 9.5, the number of states does not change as 
the separation R is reduced. When the atoms are separated by a large distance, 
there are two states, each at — 13.6 eV, so the total energy at R = oo is —27.2 eV. 
When the separation is reduced, there are still two states, but now at different 
energies. One state corresponds to the sum of the two wave functions and leads 
to a stable H, molecule; the other state corresponds to the difference of the two 
wave functions and does not give a stable molecule. The molecular state that 
leads to a stable molecule is known as a bonding state, and the one that does not 
lead to a stable molecule is an antibonding state. 

As we found previously for H*, in order to form a molecule, the electron 
probability distribution must be large in the region between the two protons. In the 
case of Hy), this is true for both electrons, and it is certainly our expectation, based 
on the Pauli principle, that for the two electrons both to occupy the molecular state 
leading to the large probability in the central region, their spins must be oppositely 
directed; that is, one must have m, = +!/2 and the other m, = —1/2 . As long as 
the two electrons have opposite spins, they can both occupy the bonding state, 
leading to a stable molecule. 

The energy of the bonding state for H, is shown in Figure 9.6; as you can see, 
there is a minimum with E = —31.7eV at R = 0.074nm. The molecular binding 
energy of H, is the difference between the energy of the separated neutral H 
atoms and the energy of the combined system: 


B= E(H+ 4H) — £E(A,) = 2(-13.6eV) — (—31.7eV) = 4.5eV (9.3) 


Comparing Figures 9.4 and 9.6, you can immediately see the effect of adding 
an additional electron to 1 the binding energy is greater (the molecule is more 
tightly bound), and the protons are drawn closer together. Both of these effects 
are due to the presence of the increased electron density in the region between the 
two protons. 
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FIGURE 9.6 Bonding and antibond- 
ing in H,. 
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We can also understand why He does not form the molecule He, —as two 
He atoms are brought together, the bonding and antibonding states are formed in 
much the same way as with H,. The He, molecule would have four electrons; at 
most two can be in the bonding state, so the other two must be in the antibonding 
state. The net effect is that no stable molecule forms. (However, Het is stable, 
with two bonding electrons and only one antibonding electron. The binding 
energy of Het is 3.1 eV and the separation is 0.108 nm, remarkably close to the 
corresponding values of Hy. ) 


9.2 COVALENT BONDING IN MOLECULES 


The sharing of electrons in a molecule such as H, is the origin of the covalent 
bond; this type of bonding occurs commonly in molecules containing two identical 
atoms, in which case it is called homopolar or homonuclear bonding. 

The essential features of covalent bonding are: 


1. As two atoms are brought together, the electrons interact and the separate 
atomic states and energy levels are transformed into molecular states. 

2. In one of the molecular states, the electron wave functions overlap in such 
a way as to give a lower energy than the separated atoms had; this is the 
bonding state that leads to the formation of stable molecules. 

3. The other molecular state (the antibonding state) has an increased energy 
relative to the separated atoms and does not lead to the formation of stable 
molecules. 

4. The restrictions of the Pauli principle apply to molecular states just as they 
do to atomic states; each molecular state has a maximum occupancy of two 
electrons, corresponding to the two different orientations of electron spin. 


Other hydrogenlike atoms with a single s electron can also form stable 
molecules through covalent bonding. For example, two Li atoms (Z = 3, con- 
figuration 1s*2s') can form a molecule of Li,. The four 1s electrons (two from 
each atom) fill the 1s bonding and antibonding states, and the remaining two 2s 
electrons can both occupy the 2s bonding state. The binding energy of Li, is 
1.10 eV, which is considerably smaller than the binding energy of H, (4.52 eV), 
and the equilibrium separation distance of the atoms in the molecule is 0.267 nm, 
much larger than that of H, (0.074 nm). 

Other homonuclear molecules formed from s-state bonds are listed in Table 9.1. 
It is customary to characterize the molecular bond strength in terms of the 
dissociation energy rather than the binding energy; the two terms are usually 
equivalent and indicate the energy needed to break the molecule into neutral atoms. 
The dissociation energy is weakly temperature dependent. Some tabulations list 
the values at room temperature (as in Table 9.1), while others list values at 0 K. 
The room-temperature values are higher than the 0 K values by about 1.547 = 
0.04 eV. 

As Z increases, meaning that the s electrons are associated with increasing 
principal quantum numbers n, the dissociation energy decreases and the equilib- 
rium separation increases. This is consistent with the behavior of the s electron 
in atoms as n increases—as Figure 8.7 shows, the radius of the orbit of the s 
electron increases with increasing n for the alkali elements. 
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TABLE 9.1 Properties of s-Bonded Molecules* 












































Molecule Dissociation Energy (eV) Equilibrium Separation (nm) 
H, 4.52 0.074 
Li 1.10 0.267 
Na, 0.80 0.308 
kK, 0.59 0.392 
Rb, 0.47 0.422 
Cs, 0.43 0.450 
LiH 2.43 0.160 

LiNa 0.91 0.281 
NaH 2.09 0.189 
KNa 0.66 0.347 
NaRb 0.61 0359 





*Values taken from the Handbook of Chemistry and Physics and the American Institute of 


Physics Handbook. 


We can also form molecular bonds with two different alkali elements. Some of 
these are listed in Table 9.1. The dissociation energies and equilibrium separations 
are consistent with those of the corresponding homonuclear molecules. For 
example, the dissociation energy and equilibrium separation of LiH are midway 
between those of H, and Li,. 

Atoms with valence electrons in p states can also form diatomic molecules 
through covalent bonds—oxygen and nitrogen, for example. There are three 
atomic p states, so there will be six molecular p states, and the classification of 
levels can become quite tedious, but we can understand the structure of molecules 
composed of atoms with p electrons based on the geometry of atomic p states. 

In Chapter 7 we solved the Schrédinger equation for the H atom and showed 
the spatial probability distributions for the various possible electronic wave 
functions. Of course, these solutions for hydrogen will not be correct for other 
atoms, but the essential features of the geometry of the atomic states remains 


correct. We identified three different p states, corresponding to m, = —1, 0, and 
+1. The probability distributions corresponding to these m, values were shown in 
Figure 7.11. 


We can imagine these distributions to have a sort of “figure-eight” shape with 
two distinct lobes of large probability. In the m, = 0 case, the figure eight has 
its long axis along the z axis, and the two lobes of maximum probability occur 
in the +z and —z directions. In the m, = +1 cases, the probability distribution 
can be regarded as occurring from a figure eight probability distribution in the 
xy plane that is rotating about the z axis, counterclockwise for m; = +1 and 
clockwise for m; = —1. Because of the uncertainty principle, we can’t observe 
the two probability lobes in the xy plane; all we can observe is the smeared-out 
“donut-shaped” distribution shown in Figure 7.11. 

For our purposes here, it is not as convenient to use the m, notation as it is to 
use a different representation in which we assign each of the three possible p states 
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FIGURE 9.7 Probability distributions 
of three different p electrons. 





(b) 


FIGURE 9.9 (a) Overlap of p, proba- 
bility distributions. The vectors indi- 
cate the force on the nuclei due to the 
overlap. (b) Overlap of p,, probability 
distributions. The off-axis forces give 
a smaller resultant force along the axis. 


a label that gives the direction in space corresponding to the lobes of maximum 
probability. Thus p, is the state with regions of large electron probability along 
the z axis, and similarly for p, and p,. Figure 9.7 shows a schematic representation 
of these probability distributions. (The p, state corresponds exactly to m, = 0; p, 
and p,, correspond to mixtures of m, = +1 and m; = —1.) Just as the uncertainty 
principle does not allow us to observe the two lobes of probability in the xy plane, 
it also forbids us from observing the separate p, and p,, probability distributions. 
However, the distributions do exist (even through we can’t observe them), and 
two atoms can interact with one another by means of these electron clouds. 

We consider the structure of molecules containing p electrons based on this 
model of the three mutually perpendicular p states p,, p,, and p,. We discuss three 
applications of this type of covalent bonding: pp bonds, sp directed bonds, and sp 
hybrid states. 


pp Covalent Bonds 


Consider what happens when we bring together two p-shell atoms, whose 
probability distributions are each similar to Figure 9.7. We assume that the atoms 
approach along the x axis, as in Figure 9.8. As the atoms are brought together, 
the p, states overlap (Figure 9.9a), giving (if the two wave functions add) an 
increased electron charge density between the two nuclei and contributing to the 
bonding of the atoms in the molecule. There is a much weaker overlap between 
the p, states (Figure 9.95) and also between the p- states (which are not shown in 
the figure). Because the overlap of the p,, states is not along the line connecting 
the nuclei, there are components to the binding force that oppose one another, 
and only a much smaller resultant force acts along the line connecting the nuclei 
(Figure 9.95). In addition, there is less overlap of the p,, states. The net result is 
that the p,, states (and also the p, states) are less effective in binding the molecule 
than the p, states. 

This somewhat oversimplified model suggests that the p,. state should have a 
much greater bonding effect (and also a greater antibonding effect) than the p, 
and p, states. It also suggests that the bonding and antibonding effects of p,, states 
should be the same as those of p, states. 

Now we can consider the energies of the molecular states as a function of the 
nuclear separation distance R. We assume that we are dealing with two atoms 





FIGURE 9.8 Two atoms with p electrons. The p, probability dis- 
tribution, which extends perpendicular to the page, is not shown. 
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having filled 1s and 2s states and valence electrons in the 2p shell. When the 1s 
states of the two atoms overlap, the result is 1s bonding and antibonding molecular 
states, just as in the case of H,. There are altogether four 1s electrons in the 
molecule, and with two in each state the 1s bonding and antibonding molecular 
states are filled to capacity. The same is true of the 2s states. The atomic 2s levels 
form bonding and antibonding molecular states; because each atom has a filled 2s 
shell, the four 2s electrons fill both the bonding and antibonding molecular states. 

The atoms have partially filled 2p shells, so the final molecular bonding 
depends critically on the molecular 2p states. For each atomic p state (p,,P,,P-) 
there are corresponding bonding and antibonding molecular states. However, the 
bonding and antibonding effects of these states are not equivalent, as Figure 9.9 
illustrates. The p state that happens to lie along the line of approach (p,.) has an 
effect that is significantly greater than the p states that lie off the line of approach 
(p,,p-). The p, bonding state must therefore lie lower in energy than the p, and 
p- bonding states, and the p,. antibonding state must lie higher in energy than the 
Pp, and p, antibonding states. Figure 9.10 illustrates the energies of the molecular 
states. The relative stability of a molecule can be determined based on the filling 
of the bonding and antibonding states with electrons (two per state, corresponding 
to spin up and spin down electrons). The following example illustrates how these 
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states are filled. 


Example 9.1 


FIGURE 9.10 Bonding and antibond- 
ing 2p states. 





Based on the filling of the bonding and antibonding states, 
predict the relative stability of the molecules (a) No, 
(b) Op, and (c) F). 


Solution 

(a) Nitrogen (1s*2s?2p°) has seven electrons: two each in 
the filled 1s and 2s shells, and three electrons in the 2p shell. 
In the N, molecule, there are therefore 14 electrons. We 
start in Figure 9.10 with two in the bonding 1s state, then 
two in the antibonding Is, then two more in the bonding 
2s, followed by two in the antibonding 2s for a total of 
eight electrons in the s states. That leaves six 2p electrons 
for the 2p molecular states. We can place two each in the 
three lowest 2p bonding molecular states, thus filling those 
states. No electrons go into the 2p antibonding states. With 


only bonding 2p electrons, N, forms a very stable diatomic 
molecule. 

(b) Oxygen (1s?2s?2p*) has eight electrons, so the O, 
molecule has a total of 16 electrons. As in N,, the first eight 
electrons fill the 1s and 2s states, leaving eight additional 
electrons for the 2p states. The first six of those fill the three 
bonding states, and so the remaining two must go into 2p 
antibonding states. With six bonding and two antibonding 
valence electrons, we would expect that O, is less stable 
than N,, which has only bonding valence electrons. 

(c) Fluorine (1s*2s?2p>) has nine electrons, so of the 18 
electrons in the F, molecule 10 must be placed in the 2p 
states: six in the bonding states and four in the antibonding 
states. Thus F, should be less stable than O). 





How well do the properties of these molecules agree with our predictions? N, 


has a dissociation energy of 9.8 eV and is not reactive under most circumstances. 
O, has a smaller dissociation energy (5.1 eV); the O, molecular bonds can be 
broken by relatively modest chemical reactions, as, for example, the oxidation 
of metals exposed to air. F, has an even smaller dissociation energy (1.6 eV); 
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fluorine gas reacts quite violently with many substances, and the F, molecule 
can be broken apart by exposure to visible light (which has photon energies of 
2—4eV) in a process known as photodissociation. The properties of these 2p 
molecules are thus quite consistent with expectations based on the filling of the 
bonding and antibonding states. Similar relationships occur for the 3p, 4p, 5p, and 
6p homonuclear molecules. 


sp Molecular Bonds 


It is often the case that a stable molecule is formed from two atoms, one with an 
s-state valence electron and the other with one or more p-state valence electrons. 
Consider, for example, the HF molecule. The F atom has five electrons in the 
p Shell, so of the three 2p atomic states, two will each have their capacity of 
two electrons, and the third will have a single electron. We ignore the four 
paired p electrons, which do not significantly affect the molecular bonding, and 
concentrate instead on the single unpaired p electron. The two-lobed probability 
distribution corresponds to a two-lobed p—state wave function, in which the signs 
of yw are opposite for the two lobes. The ls wave function of H has only one sign 
(Figure 9.11). As the H and F atoms approach each other from a large distance, 
the H wave function and the F wave function can combine to give an increased 
electron probability in the region between the nuclei, and hence a bonding sp state 
is formed. It is also possible to have antibonding sp states, which result from the 
H and F wave functions having opposite signs and producing a reduced electron 
probability density between the nuclei. 

Table 9.2 gives dissociation energies and nuclear separation distances for some 
sp-bonded diatomic molecules. 

Consider now the structure of the water molecule, H,O. Oxygen has eight 
electrons, four of which occupy the 2p shell. When we place these electrons in the 
2p atomic states, we begin with one electron each in the p,,p,, and p, states, and 
then the fourth 2p electron must pair with one of the first three. An oxygen atom 
therefore has two unpaired 2p electrons, each of which can form a bond with the 


TABLE 9.2 Properties of sp-Bonded Molecules 






































Molecule Dissociation Energy (eV) Equilibrium Separation (nm) 
HF 5.90 0.092 
HCl 4.48 0.128 
HBr B19) 0.141 
HI 3.10 0.160 
LiF 5.98 0.156 
LiCl 4.86 0.202 
NaF 4.99 0.193 

NaCl 4.26 0.236 
KF Sulls) 0.217 
KCl 4.43 0.267 
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1s electron of H to form a molecule of H,O. Figure 9.12 shows a representation 
of the electron probability distributions we might expect for an oxygen atom and 
for a molecule of H,O. Such a molecule has directed bonds, which have a fixed, 
measurable relative direction in space. The expected angle between the two bonds 
is 90°; this angle can be measured experimentally by, for example, measuring the 
electric dipole moment of the atom, and the result, 104.5°, is somewhat larger 
than we expect. This discrepancy can be interpreted as arising from the Coulomb 
repulsion of the two H atoms, which tends to spread the bond angle somewhat. 
As another example, consider the NH; (ammonia) molecule. With Z = 7, the 
nitrogen atom has three unpaired p electrons, one each in the p,, p,,, and p, atomic 
states. Each of these can form a bond with a H atom to form the NH; molecule, 
and we expect to find three mutually perpendicular sp bonds (Figure 9.13). The FIGURE 9.12 Overlap of electronic 
measured bond angle is 107.3°, again indicating some repulsion between the wave functions in H,O. 
H atoms. 





Table 9.3 lists some bond angles measured for other molecules that have sp 
directed bonds. As you can see, the bond angle does indeed approach 90° in many + 
cases. Based on the discussion given above, you should be able to explain why 
this happens as the Z of the central atom increases. 


sp Hybrid States 


One example of a 2p atom we have so far not considered is carbon, and for a 
special reason: carbon forms a great variety of molecular bonds, with a resulting 
diversity in the type and complexity of molecules containing carbon. It is this 
diversity that is the basis for the many kinds of organic molecules that can form, 
based on various kinds of carbon molecular bonds, and so an understanding of 
the physics of carbon molecular bonds is essential to the understanding of many F|GURE 9.13 Overlap of electronic 
fundamental questions of structure and processes in molecular biology. wave functions in NH3. 

Carbon, with six electrons, has the configuration 1s?2s*2p*, so we expect 
carbon under ordinary circumstances to show a valence of 2, with the two 2p TABLE 9.3 Bond Angles of sp Direct- 
electrons contributing to the structure, and we might therefore expect to form 

ae . ae ed Bonds 

stable molecules such as CH,, with directed sp bonding (similar to H,O) and a 























bond angle of roughly 90°. Instead, what forms is CH, (methane) in a tetrahedral Bond 
structure (Figure 9.14), with four equivalent bonds. For another example, the Molecule Angle 
elements of the third column of the periodic table (boron, aluminum, gallium, . . .) HO 104.5° 
: 2 ) : 
have the outer configuration ns*np (n = 2 for boron, n = 3 for aluminum, etc.), 
and we expect these elements to form compounds as if they had a single valence H)S 93.3° 
p electron. We therefore expect halides such as BCI or GaF, oxides such as B,O H,Se 91.0° 
or AL,O, nitrides such as B,N or Al,N, hydrides such as BH or GaH, and so HLT 895° 
forth. Instead we find that boron, aluminum, and gallium generally behave as if ORR a a fs | 
they had three valence electrons, and form compounds such as BCI, Al,O3, AIN, NH, 107.3° 
and B,H,. Furthermore, the three valence electrons seem to be equivalent; there PH 93.3° 
seems to be no way, for example, to associate two of the valence electrons with 2 
s states and one with a p state. The bonds formed by the three electrons make AsH; 91.8° 
equivalent angles of 120° with one another. SbH; 91.3° 














It is the effect of sp hybridization that is responsible for the valence of three 
(rather than one) in boron and four (rather than two) in carbon. The four bonds in 
CH, are equivalent and identical, which would not be expected if we had two ss 
bonds and two sp bonds; similarly, in BF; or BCl,, the three bonds are identical 
and are clearly not identified with two sp bonds and one pp bond. 
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FIGURE 9.14 Tetrahedral arrange- 
ment of molecular bonds in CHy. 





sp hybrids 


The normal meaning of Aybrid is an offspring resulting from the union of 
parents of different types, in which the offspring is not exactly like either parent, 
but retains some of the attributes of each. In the case of molecules, hybridization 
refers to a process by which the states can no longer be identified as either s or 
p States, but rather are mixtures of s and p states. The formation of sp hybrids is 
normally as follows: 


1. In an atom with the configuration 2s*2p", one of the 2s electrons is excited to 
the 2p shell, giving a configuration 2s!2p"*!. 

2. The hybrid states are formed by taking equal mixtures of the wave functions 
representing the 2s state and each of the 2p states. For example, in the case of 
boron, the configuration of 2s*2p! is converted to 2s!2p”. Assuming the 2p 
states to be 2p, and 2p, the resultant hybrid wave functions can be represented 
as different combinations of ,, Wo,,,andW>,,, such as 


y= Vast Wop, +Wrp, or y= Vos _ Wop, + Wp, or y= Vost Wop, = Wop, 


Illustrations of the probability distributions expected for sp, sp*, and sp* hybrids 
are shown in Figure 9.15. Keep in mind that these do not yet represent molecular 
states—they are merely the contribution of one of the atoms to the bonding 
electronic distributions of the molecule. 

The tetrahedral structure of CH, is therefore merely the result of the symmet- 
rical spatial arrangement of the four sp* hybrid states of C, with each hybrid state 
bonded to one H. The bond angle of such a symmetrical tetrahedron is 109.5°, 
in good agreement with the measured bond angles of CH, and other sp? hybrids, 
shown in Table 9.4. 





120° 
——. 


sp hybrids sp? hybrids 


FIGURE 9.15 Probability distributions in sp, sp*, and sp? hybrids. 
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FIGURE 9.16 Molecular bonding in C,Hy,. For clarity the p, overlap is not shown; 
the dashed arrows indicate the bond formed by the unhybridized p, states. 
TABLE 9.4 Bond Angles of sp* 











It is also possible for only two 2p electrons in the 2s!2p> configuration of elton 
carbon to become involved in hybrid states; the third 2p electron then is available, Bond 
for example, to form ordinary pp molecular states. The ethylene molecule C,H, is Molecule Angle 
an example of such a structure. The three sp” hybrids form bond angles of 120°; Cal, 109.5° 
each carbon atom has two of its three sp” hybrid states joined to a H atom, and 5 
the third is joined to the other carbon. The unhybridized p electron also forms a C,H¢ 109.3 





bond between the two carbons, in a manner similar to that shown in Figure 9.9b C,Cl¢ 109.3° 
for the “off-axis” pp molecular states. Figure 9.16 shows a representation of the 














bonding in C,H,. There are two bonds between the pair of carbons, one from the ele: HEED 
sp” hybrid and the other from the unhybridized p, state. Another example of sp” CH,Cl 110.5° 
hybrids in carbon is benzene (C,Hg), in which each carbon is joined to one H SiHF; 108.2° 
and two other carbons by the sp” hybrids, with again one unhybridized p orbital ; : 
available to bond the carbons. The basic structure of benzene is a ring of carbon SiH3C1 110.2 





atoms, as shown in Figure 9.17, with the expected angle of 120° between the GeHCl, 108.3° 
hybrid states, which gives the molecule its characteristic hexagonal shape. GeH,C! 110.9° 




















(b) 


FIGURE 9.17 (a) Molecular bonding in benzene. As in Figure 9.16, the overlap of the unhybridized p, states is not shown, 
but the dashed arrows indicate the p, bonds. (b) Scanning electron microscope image showing electron distribution of the 
benzene molecular structure on a copper surface. 
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FIGURE 9.18 Molecular bonding in C,H,. The carbons are joined by three 
bonds, one from the p, hybrid and two from the unhybridized p,, and p, states 
(dashed arrows). 


Finally, it is also possible for carbon to form sp hybrids, leaving two unhy- 
bridized p states. Acetylene (C,H,) is an example of such a molecule, with the 
two carbons now joined by three bonds, one from the sp hybrid and two from the 
unhybridized p states. Figure 9.18 shows the bonding arrangement in C,H). 

This variety of bonds entered into by carbon is the basis for the varied properties 
of organic molecules, from the simple ones we have studied here, to the complex 
ones that form the basis of living things. However, it is not only carbon that shows 
sp hybridization, but other atoms as well. (Indeed, the failure of NH; to show the 
expected 90° bond angle could be blamed on sp hybridization rather than on the 
repulsion of the H atoms.) It is also possible to have 3s—3p hybrids (silicon) and 
4s—4p hybrids (germanium). It is this hybridization that gives these materials, 
like carbon, a valence of 4 and a symmetrical bonding arrangement, which are 
partly responsible for the usefulness of Si and Ge as semiconducting materials, as 
we will discuss in Chapter 11. It is also interesting to speculate on the possibility 
of a new type of organic chemistry, including new life forms, based on Si or Ge, 
rather than C (Figure 9.19). 





FIGURE 9.19 Artist’s image of an alien landscape populated by life forms 
based on silicon. Because silicon is similar to carbon in its chemical behavior, 
it can bond with hydrogen and oxygen to form complex molecules. Creatures 
based on silicon would look crystalline in appearance and be able to survive 
in high-temperature environments. 
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9.3 IONIC BONDING 


In covalent bonding, as we have seen, the bonding electrons do not belong to 
any particular atom in the molecule, but rather are shared among the atoms. It 
is also possible to form a molecule that results from the extreme opposite case 
in which valence electrons are not shared but instead spend all of their time in 
the neighborhood of only one of the atoms of the molecule. Consider the ionic 
molecule NaCl. Suppose we have a neutral sodium atom (1s?2s*2p°3s'), with one 
3s electron outside of a filled shell, and a neutral chlorine atom (1s*2s?2p°3s73p°), 
which lacks one electron from a filled 3p shell. To remove the outer electron 
from Na requires 5.14 eV, the ionization energy of Na, and we are left with a 
positively charged Na™ ion. If we then attach that electron to the Cl atom, creating 
a negatively charged Cl” ion, the energy released is 3.61 eV, the electron affinity 
of Cl. The energy is released because the filled 3p shell is an especially stable 
configuration, which is energetically very favorable. Thus, if we borrow 5.14eV 
to ionize the Na, we get back immediately 3.61 eV by attaching the electron to 
the Cl. We can get back the remaining 1.53 eV (=5.14eV — 3.61 eV) by moving 
the Nat and Cl” close enough together that their Coulomb potential energy is 
—1.53 eV. The separation distance corresponding to this potential energy is found 
from the potential energy equation, U = q,q7/47€R: 


_ 2 | _ 1.44eV-nm 


== = = 0.941 nm 
Arey U 1.53 eV 





That is, as long as the Cl” and Na’ are closer together than 0.941 nm, the Coulomb 
attraction will supply enough energy to overcome the difference between the 
ionization energy of Na and the electron affinity of Cl. Put another way, Na* and 
Cl” ions separated by less than 0.941 nm have a more stable configuration than 
neutral Na and Cl atoms. 

However, we cannot push the Nat and Cl” ions too close together, because 
eventually the filled p shells begin to overlap, which causes the ions to repel one 
another. The Pauli principle forbids additional electrons in either filled p shell. 












































Forcing the two ions closer together therefore requires that some of the electrons A 
be pushed from the 2p or 3p shells into a higher shell, in order to “make room” ' Pauli repulsion Up 
for the overlapping electrons. Because this takes energy, we must therefore add . : 
energy to the Nat + Cl” system in order to reduce the separation of the ions es eee | 
beyond a certain point. We can imagine this energy as a sort of “potential _ us 0236 nm 0.941 nm 
energy” of repulsion, which increases rapidly as we try to force the ions close 3 °\ toa 04 06 10 1.2 
together. & Re) 
In summary, when the ions are far apart, they attract one another, and when 5 ~* eF] 
they are too close together, they repel one another; in between there must be an ! lees botental Ue 
equilibrium position where the attractive and repulsive forces are balanced. It is -4 7 
this equilibrium position that determines the size of an ionic molecule. See ee 
Figure 9.20 shows the energy of the NaCl molecule as a function of the -6 i 
separation distance between the nuclei. Taking the energy zero point to refer to 
the neutral atoms, we can separate the molecular energy into three terms: the FIGURE 9.20 Molecular energy of 


constant AE = 1.53 eV, which represents the difference in energy between the NaCl. The zero of the energy scale 
ions and the neutral atoms; the Coulomb attraction Uc between the ions; and the represents neutral Na and Cl atoms. 
Pauli “repulsion” Up, which is represented in Figure 9.20 as a potential energy The solid curve is the sum of the three 
that rises rapidly for small R and falls to 0 for values of R that exceed the sum of contributions to the molecular energy. 
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TABLE 9.5 Properties of Some lonic Diatomic Molecules 



































Molecule Dissociation Energy (eV) Equilibrium Separation (nm) 
NaCl 4.26 0.236 
NaF 4.99 0.193 
NaH 2.08 0.189 
LiCl 4.86 0.202 
LiH 2.47 0.239 
KCl 4.43 0.267 
KBr 3.97 0.282 
RbF Sal 0.227 
RbCl 4.64 0.279 











the ionic radii, at which point the electron shells no longer overlap and the Pauli 
repulsion does not occur. The sum of these three terms gives the molecular energy 
of NaCl, which shows a minimum at an equilibrium separation R,, of 0.236 nm. 
At that distance the energy is —4.26eV, so the binding energy or dissociation 
energy (the energy needed to split the molecule into neutral atoms) is 4.26 eV. 
Table 9.5 shows the equilibrium separation and dissociation energy of several 


ionic molecules. 


Remember that we are concerned here with isolated molecules and not with 
collections of molecules in solids. When we speak of NaCl in this section, we 
mean not the solid salt but rather a gas of NaCl molecules. The spacing between 
atoms in a solid can be very different from the spacing in a molecule. 


| Example 9.2 





(a) What is the value of the Pauli repulsion energy of NaCl 
at the equilibrium separation? (5) Estimate the value of the 
Pauli repulsion energy at a separation of 0.1 nm. 


Solution 
(a) At the equilibrium separation distance, the Coulomb 
energy is 


l ¢ 1.44eV- 
Ge eo eV-nm _ 


6.10eV 
Are) Reg 0.236nm 





and the Pauli repulsion energy can be found from the 
molecular energy E: 


Up =E-—Up—AE 
= —4.26 eV — (—6.10 eV) — 1.53eV = 0.31 eV 


(b) From Figure 9.20 we estimate E = +4.0 eV at R= 
0.1 nm. The Coulomb energy at R = 0.1 nm is —14.4eV, 
and the repulsion energy is 


Up =E-Up— AE 
= +4.0eV — (—14.4eV) — 1.53eV = 16.9eV 


Note how rapidly the repulsion energy increases at 
small R. 





Covalent and ionic bonding represent two opposite extreme cases, one in which 
electrons are shared between two atoms and the other in which the electrons always 
are associated with one of the atoms. How can we decide whether two atoms 
will join by covalent or ionic bonding? The answer depends on the willingness of 
the atoms to share their electrons, or equivalently the degree to which one atom 
dominates the other in its desire to have the valence electrons all to itself. 

For homonuclear diatomic molecules, we expect a purely covalent bond; 
because the two atoms are exactly alike, neither can dominate and the electron 
is completely shared between them. For heteronuclear molecules, however, the 
situation is quite different. There can be no purely ionic or purely covalent 
bond. The two atoms have different atomic numbers and different electronic 
configurations, so the wave function of a valence or bonding electron, even if 
shared, will not be exactly the same near one atom as it is near the other. The 
electron must therefore spend more time near one atom than near the other; this 
means that one atom may have a slight excess of negative charge and the bond, 
even if we think of it as being covalent, will also have a small ionic character. 
Conversely, a “pure” ionic bond has a small covalent character; we learned in 
Chapter 7 that electron wave functions do not suddenly drop off to zero amplitude, 
but instead fall exponentially to zero. Therefore, even in an ionic molecule like 
NaCl, the wave function of the electron that was transferred to the Cl” ion is not 
zero at the location of the Nat ion; the wave function may indeed have a very 
small amplitude at the Na* ion, but it is not zero. The electron thus spends some 
of its time, even if very little, being shared between the atoms, and the bonding, 
while mostly ionic, has a small covalent character as well. 

An empirical way to characterize the relative ionic character of a molecule 
is based on the electric dipole moment of the molecule. An electric dipole, as 
illustrated in Figure 9.21, consists of two charges +g and —q separated by a 
distance r. The electric dipole moment of this arrangement is defined as 

p=q (9.4) 
In a purely covalent molecule, there is no excess charge on either atom, so in 
effect g = 0 and the dipole moment is expected to be zero. In an ionic molecule, 
on the other hand, a net positive charge resides on one atom and a net negative 
charge on the other, so the dipole moment is not zero. 

If NaCl were a purely ionic molecule, we would expect a dipole moment of 


P= qReq = (1.60 x 107!” C)(0.236 x 10~? m) = 3.78 x 10-7” C-m 


The measured electric dipole moment of NaCl is 3.00 x 1072? C-m, which is 
0.79 or 79% of the maximum value corresponding to a purely ionic bond. Because 
NaCl is only partially ionic, the measured dipole moment is smaller than the full 
ionic value. By taking the ratio between the measured electric dipole moment and 
the maximum value calculated from Eq. 9.4, we can determine a measure of the 
fractional ionic character of the molecular bond: 


Pmeasured 


qR 


fractional ionic character = (9.5) 


eq 
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Linus Pauling (1901-1994, United 
States). Known mostly as a chemist, 
his work on molecular bonds tran- 
scends the traditional chemistry- 
physics boundary. He is the only recip- 
ient of two unshared Nobel Prizes, the 
chemistry prize in 1954 for his work 
on molecular bonds and the peace 
prize in 1963 for his efforts toward a 
ban on the testing of nuclear weapons. 





FIGURE 9.21 An electric dipole. 
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TABLE 9.6 Electronegativities of Some Elements 


H 2.20 
Lit (9% 
Na 0.93 
K_ 0.82 
Rb 0.82 
CSO 9 





Be 1.57/C 2.55|N 3.04/O0 3.44/F 3.98 
Mg 1.31/Si 1.90/P 2.19/S 2.58/Cl 3.16 
Cay O0)|[Ger2 01 FAS e223) |'See 2-55) Br 2.96 
Sr 0.95]Sn 1.96)Sb 2.05|Te 2.10]I 2.66 
Ba 0.89 
































Is there a property of atoms that allows us to predict whether they will join with 
one another in ionic or covalent bonds? One property that has some predictive 
value is the electronegativity, which can be roughly defined as the capability of an 
atom to attract electrons when it forms chemical bonds. The electronegativity of 
an atom can be computed from the sum of the energy cost of removing an electron 
(the ionization energy) and the energy gain in adding an electron (the electron 
affinity). Table 9.6 shows the electronegativity values for some elements. 

If the two atoms in a molecule have equal electronegativities, they have equal 
tendencies to attract electrons and should therefore form covalent bonds. If the 
two electronegativities are very different, one atom will have a greater tendency 
to attract electrons; as a result, there will be a net negative charge on one atom 
and a net positive charge of equal magnitude on the other. The molecular bonding 
will then have at least a partial ionic character. 

We therefore expect a rough relationship between the fractional ionic character 
(determined by comparing the measured electric dipole moment with its maximum 
expected value) and the magnitude of the difference between the electronegativity 
values. Figure 9.22 shows the fractional ionic characters plotted against the 
electronegativity differences. Although the points are scattered, you can see that 
there is indeed a direct relationship between the quantities: molecules with small 
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FIGURE 9.22 The fractional ionic character of bonds in diatomic molecules. 
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electronegativity differences have only a small ionic character and therefore form 
covalent bonds, while molecules with large electronegativity differences form 
mostly ionic bonds. 


9.4 MOLECULAR VIBRATIONS 


So far we have considered the properties of molecules based on the configurations 
of their electrons. The electrons largely determine the strength of the molecular 
bonds and their geometry. Molecules can absorb or emit energy by changing 
the configuration of their electrons, just as atoms can do. However, molecules 
can also absorb or emit energy, generally in much smaller quantities, in other 
ways. One way is for the atoms in a molecule to vibrate about their equilibrium 
positions, just like a mass on a spring. Another way is for the atoms to rotate 
about the center of mass of the molecule. We consider molecular vibrations in this 
section and rotations in the next section. The vibrational and rotational energies 
are often unique to particular molecules and can serve as a kind of “fingerprint” 
in identifying simple molecules. 


Vibrations in Quantum Mechanics 


A classical vibrating system such as a mass m on a spring of force constant k 
has an oscillation frequency w = ,/k/m. (Here we are dealing with the angular 
or circular frequency w = 27f, measured in radians/second.) The oscillator has 
potential energy U = $hx?, and with maximum amplitude x,, its (constant) total 
energy is E = shoe, For the classical oscillator, there are no restrictions on the 
total energy or the oscillator frequency—any value of the energy is allowed, and 
the frequency and energy can be varied independently. 


= 11 
As we learned in Chapter 5, the quantum oscillator behaves very differently. Eng NaS 
Only certain values of the energy are allowed. The allowed energy values, for an 
oscillator that can move in only one dimension, are E=2 hf N=4 
Ey = (N+ })io=(N+5)hf  N=0,1,2,3,--- (9.6) E=Lnf N=3 
where w is the classical oscillator frequency. The ground-state energy is sho = E= 5 nf eae 


shf , and the excited states are equally spaced with energy differences of fiw = hf. 
Figure 9.23 shows the excited states of a one-dimensional quantum oscillator. 2 
Note that the ground state does not have E = 0; this is a consequence of the amas N=l 
uncertainty principle—an oscillator with E = 0 would have a displacement of 
exactly zero and also a momentum of exactly zero, which would violate the 
position-momentum form of the uncertainty principle. The minimum energy of 
shf is often called the zero-point energy, and it can have observable consequences 
(see Problem 31). FIGURE 9.23 Some energy levels 

The oscillator can emit electromagnetic radiation in jumps to lower states, or it and transitions in a one-dimensional 
can absorb radiation in jumps to higher states. However, the jumps cannot occur quantum oscillator. 


Ground state 
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FIGURE 9.24 The two atoms in a vi- 
brating diatomic molecule. The center 
of mass of the molecule remains fixed 
at the origin of the coordinate system. 


in arbitrary steps. The allowed transitions are those that change N by one unit. 
This restriction is called a selection rule and applies only to transitions that occur 
through the emission or absorption of electromagnetic radiation. 


Vibrational selection rule: AN=+1 (9.7) 


Other causes of jumps, such as collisions, are not subject to this restriction. 

Although this model of a quantum oscillator is oversimplified, we do find in 
nature many systems that do exhibit this type of behavior. In a molecule, the 
individual atoms can vibrate relative to the center of mass. A nucleus, as we 
discuss in Chapter 12, is sometimes modeled as a vibrating liquid drop. Both 
of these systems have some properties in common with the quantum oscillator, 
although the effective force constant k is not quite constant in those systems, 
which can lose stiffness as the excitation energy increases. In a solid, the atoms 
can often behave like three-dimensional oscillators, the properties of which are 
different from the one-dimensional oscillator we are considering here. We will 
discuss the oscillations of the atoms in a three-dimensional solid from a statistical 
viewpoint in Chapter 11. 


Vibrating Diatomic Molecules 


A molecule with more than two atoms can vibrate in many different ways, so for 
simplicity we’ll consider only molecules with two atoms. Figure 9.24 shows a 
representation of the two atoms in a diatomic molecule, which oscillate so that 
the center of mass remains fixed. 

To calculate the frequencies of the emitted transitions, we must know the mass 
m and the effective force constant k. Because both atoms in the molecule are 
participating in the vibration, the vibrating mass is not the mass of either atom 
alone but instead is a combination of both masses. Let the masses of the two 
atoms be m, and m,. As they both pass through their equilibrium positions, the 
total energy of the molecule is only kinetic energy, so 


1 1 2 2 
Er= “mi + —mv3 = Ai, Fa 


= 9.8 
2 2 2m, 2m Ue) 


In a frame of reference in which the center of mass of the molecule is fixed, 
the total momentum is zero, so p; = py and the energy can be written (with 


P =P = pz) as 





1 1 1 1 m, +m p 
mabe(bed)-fo(ES2)-2 om 
mM, My 2 mM 2m 
where 
pe (9.10) 
m, +m), 


That is, the energy of the system is the same as if it were a single mass m, moving 
with momentum p. This m is a sort of effective mass of the whole molecule and 
is known as the reduced mass. It is the mass we should use in calculating the 
vibrational frequency. (Previously we used the reduced mass of the electron and 
proton in our analysis of the hydrogen atom—see Section 6.8.) 
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Notice that m = m,/2 when m, = my, as in a homonuclear molecule—the 
effective mass is half the mass of an individual atom. Whenever one mass is much 
greater than the other, the reduced mass has a value nearly equal to that of the 
lighter mass. This is consistent with our expectation, because the inertia of the 
heavier mass reduces its tendency to move, and most of the vibrational motion is 


done by the lighter mass. 


The calculation of the effective force constant for vibrational motion is 


illustrated in the following example. 





Find the vibrational frequency and photon energy for H). 


Solution 
To find the frequency, we must know the force constant k. 
To estimate k, we consider the molecule to behave like a 
simple harmonic oscillator in the vicinity of its equilibrium 
separation R,,, and we fit a parabolic oscillator potential 
energy U = $hx? to the molecular energy in that region. 
Figure 9.25 shows the region of the energy minimum and 
a parabola that approximates the curve near the minimum. 
The equation of the parabola is 

E — Evin = k(R — Reg)” (9.11) 
The constant k can be estimated from the graph by finding 
the value of R— R,, that is necessary for a certain value 
of E — E,,;,. As shown in the figure, when EF — E,., 
0.50 eV, the value of R — R,, is !/2(0.034 nm) = 0.017 nm. 
Solving for k, we obtain 


_ 2E-Enin) — 2(0.50 eV) 
~ (R=Rg)? (0.017 nm) 
= 3.5 x 10° eV/nm? = 3.5 x 107! eV/m? 


k 





The reduced mass of molecular hydrogen is half the mass 
of a hydrogen atom. We can now calculate the vibrational 
frequency: 


ae ke? 
=. m 22 V mc? 


_ 1 [G5 x 10?! eV/m*)(3.00 x 108 m/s)? 
~ 2x VY (0.5)(1.008 uj(931.5 x 10° eV/u) 











= 1.3 x 10!*Hz 


The corresponding photon energy is 
E =hf = (4.14 x 107 eV-s)(1.3 x 10'4 Hz) = 0.54eV 


The wavelength of this radiation is 2.3 ~m, which is in 
the infrared region of the spectrum. Molecular vibrations 
typically give photons in the infrared. 

Note that the parabola gives a reasonable approximation 
to the actual energy curve only for energies up to about 
1 eV above the minimum, which corresponds to the excited 
state with V = 2. If we were to excite the H, molecule to 
energies greater than 1 eV above its ground state, we would 
expect to see deviations from the behavior predicted by 
the simple harmonic oscillator. In particular, all the transi- 
tions would no longer have the same energy, and changes 
other than AN = +1 may occur. In other molecules, the 
harmonic oscillator approximation may remain valid for 
larger vibrational quantum numbers, but eventually it will 
fail in all cases at sufficiently large N. 

Vibrational energies can often be found in tabulations 
of molecular properties. Often these tabulations give the 
energy in units of cm~!. To convert to eV, multiply by the 
conversion factor hc = 1.24 x 107-+ eV- cm. 
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FIGURE 9.25 Fitting a parabola (dashed line) to the energy 
minimum of H,. 
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Summarizing our conclusions from this section for the vibrational motion 
of molecules, we expect the simple harmonic oscillator to give a reasonable 
description of the motion near the energy minimum, with a sequence of emission 
or absorption photons all of energy hf corresponding to changes of one unit in the 
vibrational quantum number. The emitted or absorbed radiations are generally in 
the infrared region of the spectrum. 

The result of Example 9.3 shows that in a gas of hydrogen molecules at 
room temperature, where the mean translational kinetic energy is about 0.025 eV, 
it is highly unlikely that a collision between molecules can deliver 0.54eV to 
one molecule and set it vibrating. This explains why the vibrational degrees of 
freedom are “frozen out” at room temperature and thus why the heat capacity 
of H, does not agree with the value expected for 7 degrees of freedom, as we 
discussed in Chapter 1. 

The vibrational energy of F, is 0.11eV, so even at room temperature (see 
Table 1.1) the heat capacity of fluorine gas is starting to increase above the 
value that is characteristic of 5 degrees of freedom (which would include only 
translational and rotational motions). By 1000 K, where the mean thermal energy 
is around 0.075 eV, the vibrational states of F, can be easily excited and so the 
heat capacity of F, at 1000 K is characteristic of 7 degrees of freedom. Oxygen 
has a larger vibrational energy (0.20 eV); it is difficult to add this much energy 
in a gas in which the mean energy is only 0.025 eV, and so at 300 K its heat 
capacity remains at the value characteristic of 5 degrees of freedom (translation 
+ rotation); the heat capacity for oxygen increases at 1000 K but not quite to the 
value for 7 degrees of freedom. Nitrogen has an even greater vibrational energy 
(0.29 eV), and so its heat capacity is smaller than that of F, or O, at 1000 K. 


9.5 MOLECULAR ROTATIONS 


A second way that a molecule can absorb or emit energy is through rotations 
about its center of mass. The state of the rotational motion of the molecule is 
described by specifying its angular momentum. In this section we discuss the 
special treatment of angular momentum in quantum mechanics and the resulting 
rotational states in diatomic molecules. 


Rotations in Quantum Mechanics 


A classical rigid rotor with rotational inertia / has a rotational kinetic energy 
given by K = 41 w = L*/2I where L = Ia is the rotational angular momentum. 
(Compare this formula with the more familiar expression for translational kinetic 
energy, K = $mv = p*/2m, and note that we can get from the translational 
expression to the rotational expression by replacing the linear momentum p with 
the angular momentum ZL and the mass m with the rotational inertia /.) 

There is no general result for the quantization of linear momentum or transla- 
tional kinetic energy in quantum mechanics, so we have no obvious clues about 
how to quantize the rotational motion. We can imagine a rotor to be a particle of 
mass m rotating on the end of a “massless” rod of length 7, but we know that in 
quantum mechanics it is not possible to have rotations in a fixed plane (see the 
discussion in Section 7.2 about the rotational form of the uncertainty relationship). 
The quantum version of the rigid rotor is thus more properly described as a particle 
confined to move in three dimensions on a sphere of radius 7 rather than in two 
dimensions in a circle of radius r. 


The general mathematical solutions for the wave functions of a three- 
dimensional rotor are beyond the level of this text (some of the solutions 
can be obtained from the angular parts of the wave functions of the hydrogen 
atom), but the resulting energy levels have a particularly simple form: 

L(L+ li? 
OF 
where LZ is the angular momentum quantum number. The rotational levels 
have energies of 0,2(h?/21), 6(h7 /21), 12(h?/21),20(h7/21),..., as shown in 
Figure 9.26. Note that, in contrast to the vibrational energy levels, the rotational 
energy levels are not equally spaced—the spacing increases as the energy 
increases. 

We can obtain Eq. 9.12 from the classical expression for the rotational kinetic 
energy by replacing the classical angular momentum with the quantum result 
given by Eq. 7.5, IL| = V/L(L + 1)h. (Here we are using an upper-case L to 
represent the angular momentum quantum number of a system such as a molecule 
instead of the lower-case /, which represents the angular momentum quantum 
number of a particle.) 

Molecules and nuclei are again good examples of quantum rotors. Because 
molecules and nuclei don’t have rigidly fixed bond lengths, they don’t quite 
follow the rigid rotor spacing exactly. As the angular momentum increases and 
the system rotates at a greater rate, the bonds can stretch somewhat, increasing 
the rotational inertia and slightly decreasing the rotational spacing. Nevertheless, 
the rotational spacing is followed very closely. 

The quantum state of the rotor can be changed through the emission or absorp- 
tion of electromagnetic radiation. As with the vibrational states, there is a selection 
rule that limits the jumps that can occur when radiation is emitted or absorbed: 


(9.13) 


L=0,1,2,3,--- (9.12) 


Rotational selection rule: AL=+1 


The permitted rotational jumps can occur only through a change of one unit in 
the angular momentum quantum number. \n practice, we find that this selection 
tule is not absolute, but that electromagnetic transitions that change the angular 
momentum quantum number by one unit are often several orders of magnitude 
more probable than those that change L by other amounts. Some of the transitions 
that change L by one unit are shown in Figure 9.26. Note that the photons emitted 
in these transitions all have different energies, in contrast to the vibrational 
excitations in which all emitted photons have the same energy. 


Rotating Diatomic Molecules 


Consider the diatomic molecule shown in Figure 9.27. The origin of the 
coordinate system is at the center of mass of the molecule, so that xm, = xm). 
The rotational inertia of the molecule is J = m,x{ + m x3. We can write this in 
terms of the reduced mass m = m,m,/(m, + m,) and the equilibrium separation 


Reg = X1 + X7 as 


T=mR,, (9.14) 
and the energies of the rotational states are 
LL + Wh? 
a ee = BL(L + 1) (2.15) 


i 2mR2, 
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FIGURE 9.26 Some energy levels 
and transitions of a quantum rotor. 
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FIGURE 9.27 A rotating diatomic 
molecule. 
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where the rotational parameter B is defined for diatomic molecules as 


ig 


= 9.16 
2mR, te) 


The emitted or absorbed photons must follow the rotational selection rule AL = 
+1, so the energy of an emitted photon is the energy difference between two 


adjacent levels: 


AE = E,,, — Ey, = BL + IL +2) — BL(L+ 1) = 2BIL + 1) 


(9.17) 


In contrast to the transitions among the vibrational excitations, which all had the 
same energy, the energies of the rotational transitions depend on L, as you can 
see in Figure 9.26. The emitted photons have energies 2B,4B,6B,.... 


| Example 9.4 





Calculate the energies and wavelengths of the three lowest 
radiations emitted by molecular H,. 


Solution 
The photon energy equals the energy difference between 
the levels, which is given by Eq. 9.17. To evaluate this 
energy, we must first determine the rotational quantity B. 
For H), the reduced mass m is half the mass of a hydrogen 
atom. Then 

i? Ic? 
~ 2mR2, — 2mc?R2, 

(197.3 eV- nm) 

~ 2(0.5 x 1.008 u x 931.5 MeV/u) (0.074 nm)? 
= 0.0076 eV 


B 








Equation 9.17 now gives the energies directly, and the 
corresponding wavelengths are found from 2 = hc/AE. 


L=\ltoL=0: AE=2B=0.0152eV A=81.6um 
L=2toL=1: AE=4B=0.0304eV A=40.8 um 
L=3toL=2: AE=6B=0.0456eV A=27.2 um 


The energies of the emitted photons form an arithmetic 
sequence with relative values 1, 2, 3,..., and the emitted 
wavelengths have the inverse sequence 1, 1/2, 1/3,.... 


The emitted rotational transitions are (like the vibrational transitions) in the 


infrared region, but the wavelengths are 1—2 orders of magnitude larger than 
those of the vibrational transitions. This region of the spectrum corresponds to 
the far infrared and microwave radiations. 

In contrast to the vibrational motion, for which the minimum energy was 
0.54 eV, the minimum energy for the rotational motion in H, is only 0.015 eV. In 
hydrogen gas at room temperature, where the mean translational kinetic energy is 
0.025 eV, it is easily possible for a collision between two molecules to cause one 
of them to start rotating. Thus the rotational motion can occur at room temperature, 
and H, behaves as if it has 2 rotational degrees of freedom (see Section 1.3). 
Gases whose molecules have greater mass than hydrogen have smaller rotational 
energies and their rotational states are even more easily excited in collisions, so for 
most gases the rotational degrees of freedom are present at ordinary temperatures. 


J Example 9.5 
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Figure 9.28 shows a portion of the rotational absorption 
spectrum of a molecule. Determine the rotational inertia of 
the molecule. 


Solution 
Each peak in the absorption spectrum corresponds to the 
molecule moving from one level of Figure 9.26 to the 
next higher level after absorbing a photon of the proper 
energy or frequency. The frequencies can be found from 
Eq. 9.17: 


AE h 
= — =(L+1)— 
fa , <+)57 


where Eq. 9.14 has been used to replace mReq with the 
rotational inertia /. This gives the frequency of each peak 
in Figure 9.28, but doesn’t allow us to find J because 
we don’t know the L value for the peaks. We can avoid 


this problem by calculating the difference in frequency Af 
between adjacent peaks Z and L + 1: 


h h 


h 
Af = (L+ 2)— -(L+1 = 
Se aI or RI 2nI 


or, estimating the spacing Af between the peaks of 
Figure 9.28 as 6.2 x 10!! Hz, 


i Ah 1058107" Jes 
~ IWAf  2n(6.2 x 10!! Hz) 
= 2.7 x 10-47 kg-m? = 0.016 u- nm? 





This value would correspond to, for example, a molecule 
with a reduced mass of | u (such as one consisting of 
hydrogen combined with a much heavier atom) and an 
equilibrium separation of 0.13 nm. 





Intensity 


L Ka 
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Frequency (units of 10'? Hz) 


FIGURE 9.28 A molecular absorption spectrum. 





Summarizing this section, we have seen that the rotational motion of a 
molecule results in a sequence of energy levels that are not equally spaced. 
The rotational energy levels are spaced 1—2 orders of magnitude closer than the 
vibrational levels. The emitted radiations form an ascending series of energies (or 
a descending series of wavelengths) in the far infrared or microwave region of 
the spectrum. The radiations are restricted by the selection rule that permits the 


rotational quantum number L to change by only one unit. 


9.6 MOLECULAR SPECTRA 


A complex molecule can absorb or emit energy in a variety of ways, as indicated 
schematically by the energy-level diagram of Figure 9.29. The emitted or absorbed 
energy can change the electronic state (in analogy with the change of energy 
levels of electrons in atoms). The energy necessary to make this change is of the 
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FIGURE 9.29 Electronic, vibrational, 
and rotational energy levels of a 
molecule. 
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FIGURE 9.30 Combined rotational 
and vibrational energy levels of a 
molecule. Transitions marked with an 
X violate the selection rules and so are 
forbidden or strongly suppressed. 


order of eV, corresponding to photons in the visible range of the spectrum. Within 
the energy minimum of any electronic state, there are vibrational and rotational 
states. The vibrational states are equally spaced and have energy separations of 
typically 0.1—1 eV. The rotational states are not equally spaced, and they have 
a smaller energy separation of typically 0.01—0.1eV. Because the rotational 
spacing is much smaller than the vibrational spacing, it is convenient to consider 
each vibrational state as providing the basis on which a sequence of rotational 
states is built. We discuss here only the vibrational and rotational structure, not 
the electronic excited states. 

Figure 9.30 shows a detail of the rotational and vibrational structure. The states 
are labeled with the vibrational quantum number WN and the rotational quantum 
number L. Depending on their properties, some molecules can show a purely 
vibrational structure (such as Figure 9.23) and others can show a purely rotational 
structure (such as in Figure 9.26). In many molecules, however, the transitions 
between the levels must simultaneously satisfy both the rotational and vibrational 
selection rules: 


|AN|=1 and |AL|=1 (9.18) 


Consider the state with quantum numbers VN = 1 and L=4, as shown in 
Figure 9.30. The molecule cannot make a transition to the next lowest rota- 
tional state (V = 1, L = 3), because it would violate the vibrational selection rule 
|AN| = 1. All transitions must simultaneously satisfy both selection rules. The 
molecule can make a transition to the state with N = OandL = 3 orto the state with 
N = 0 and L = 5. Absorption transitions must also satisfy these selection rules. 

Let’s now obtain a general expression for the energies of the transitions that 
satisfy both selection rules. The energy of the state with quantum numbers NV and 
L can be written as the sum of the vibrational and rotational terms: 


Ey, = (N + 4)hf + BL(L + 1) (9.19) 
where 
N=0,1,2,... and L=0,1,2,... 


The vibrational term is usually much larger than the rotational term, so the 
emission wavelengths in the spectrum will usually correspond to N > N — 1 
with L + L+1; the absorption wavelengths will be those for transitions in 
which WN increases by one unit. 

For absorption from initial state NV, Z to a final state V + 1, + 1 the possible 
photon energies are 





AE = En+it+i ~ Ey 
=[(N+ Saf +BL4DCL14D)]-[W + Daf + BLE + D] 





AE =hf+2B(L+1) forL>L+1 (9.20) 
AE = hf — 2BL forL—>L-—1 (9.21) 


Figure 9.31 shows the expected spectrum of absorption photons. Starting from 
the center and increasing to the right is a series of photons whose energies are 
given by Eq. 9.20 


hf + 2B, hf + 4B, hf + 6B,... 





eee 
eco —| 2B 
| 
Le=3) L222. b=1 hf L200) Le1 L=2 Photon energy 
to to to to to to 
h=2 Le l. -L=0 Le) -LS2) L=3 


FIGURE 9.31 Expected sequences of absorption transitions between com- 
bined rotational and vibrational states. Each vertical line represents the 
absorption of a photon with a sharply defined energy. 


Also starting from the center but decreasing to the left is a series of photons whose 
energies are given by Eq. 9.21: 


hf —2B,hf — 4B, hf — 6B,... 


Note that the photon of energy hf is missing at the center of the spectrum; such a 
photon would correspond to a “pure” vibrational transition, which would violate 
the rotational selection rule. 

We can compare this ideal spectrum with the real spectrum of Figure 9.32, 
which shows the absorption transitions of the molecule HCl. Although the basic 
structure of Figure 9.31 is present, including the missing pure vibrational transition 
at the center of the spectrum, there are a number of differences, which we can 
explain based on the structure of the HCI molecule. 


1. The transitions are not equally spaced. We expect all of the transitions of 
the spectrum to be separated by a constant energy 2B but, as you can see, this is 
not the case. The spacing of the lines appears to decrease as we move away from 
the center to the right and to increase as we move to the left. The explanation 
for this effect lies with our assumption that the molecule is a rigid structure with 
a constant rotational inertia. As the angular velocity and angular momentum of 
the rotating molecule increase, the somewhat nonrigid bond can stretch slightly, 
increasing the value of R,,. As Eq. 9.20 shows, the photon energies increase less 
rapidly if R,, grows, as the L + L + 1 branch in Figure 9.32 illustrates. On the 
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FIGURE 9.32 The absorption spectrum of molecular HCI. 
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other hand, an increasing R,, causes the L + L — | energies to increase more 
rapidly, as required by Eq. 9.21 and as shown by the increasing spacing of the 
lines of the left-hand branch in Figure 9.32. Both effects grow with increasing L. 

2. The heights of the peaks are quite different. The heights of the peaks give 
the intensity of the transitions, and the intensity of any transition is proportional 
to the population of the particular level from which that transition originates. 
The populations of the levels decrease as the energy increases, according to the 
Maxwell-Boltzmann distribution factor e~£/“” . The populations also increase with 
increasing L according to the factor 2L + 1, which gives the angular momentum 
degeneracy of each level; in effect, the more substates there are in each level, the 
greater its population. We can therefore write the population of a level of energy 
Enz, as 


P(Enz) = (2L + Le7 Ene sir = i+ he Wee ae ae (9.22) 


where we have used Ey, from Eq. 9.19. 

For the first few levels, the populations increase as L increases, because the 
exponential factor does not differ much from 1. However, as the energy continues 
to increase, the rapidly decreasing exponential factor begins to dominate and the 
populations decrease accordingly, becoming negligible for L > 10. 

The level with maximum population, which corresponds to the most intense 
peak in the spectrum, can be found from Eq. 9.22 by locating its maximum, where 


dp/dL = 0. The result is 
2kT 
2L4+1= Ss (9.23) 


For a measurement at room temperature (kT = 0.025 eV) and for 2B = 0.0026 eV 
(estimated from the peak spacing in Figure 9.32), we obtain L = 3 for the peak of 
maximum intensity, in agreement with Figure 9.32. 

3. Each peak appears to be two very closely spaced peaks. Chlorine consists 
of two types of atoms (isotopes) whose masses are approximately 35 u and 
37 u. The differing masses result in slightly different vibrational and rotational 
energies for the two isotopes, with the heavier atom having the smaller energies. 
Thus the peaks for the Cl atoms with mass 37 u appear at slightly lower energies 
than the peaks for the Cl atoms with mass 35 u. The lighter atom occurs with 
about three times the abundance of the heavier atom, so the peaks for the lighter 
atom have the greater intensity. 

Just as atomic spectroscopy gives us a way to identify atoms from their 
characteristic emission or absorption spectrum, molecular spectroscopy enables 
us to identify molecules by the radiation they absorb or emit. Each molecule has its 
own characteristic “fingerprint” that can be easily recognized. It is important that 
this technique tells us exactly the composition of the molecule—the number of 
atoms of each type, the isotopic ratios, even the state of ionization of the molecule. 
Thus we could easily distinguish CO from CO,, H*Cl from H3’Cl, Hy from Hy. 

As you might imagine, such a precise identification technique has many 
applications to areas where it is necessary to identify trace amounts of molecules. 
Two applications in particular are of interest. The absorption spectra of our 
atmosphere can be used to identify trace amounts of various pollutants, and 
thus molecular spectroscopy helps to measure the purity of our air. Similarly, 
the absorption spectra of interstellar dust are used to identify the molecules that 
are present in interstellar space. This is the only technique we have (so far) for 


Chapter Summary 285 


learning about the formation of complex molecules in our galaxy, because stars 


are too hot to permit molecules to exist. 


Unfortunately, our atmosphere absorbs much of the infrared and microwave 
radiation that characterizes the spectra of these molecules, but spectrometers car- 
ried on satellites beyond the atmosphere permit the observation of those radiations 
and the identification of many varieties of molecules, including some relatively 
complex organic molecules. As an added benefit, when those spectrometers are 
aimed toward the Earth, they can measure how the infrared radiation emitted by 
the Earth is absorbed in its atmosphere, and thus detect the presence of various 


atmospheric pollutants. 





(a) From the absorption spectrum of HCl (Figure 9.32), 
determine the vibrational force constant. (b) Determine the 
rotational energy spacing of the peaks and compare with 
the expected value for HCI. 


Solution 

(a) The vibrational frequency can be found from the energy 
of the “missing” central transition in Figure 9.32. This 
energy can be read from the figure to be about 0.358 eV, 
and so 


AE 0.358 eV 
f= 


= = 8.65 x 10'3 Hz 
h 4.14 x 10-HeV-s 





The reduced mass of HCI can be found from Eq. 9.10 to be 
m = 0.98 u. We can now find the force constant: 


k = 41? mf* = 417 (0.98 u)(8.65 x 10'? Hz)’ 
= 2.89 x 107? u-Hz* = 2.99 x 107! eV/m? 


(6) From the figure, the energy spacing of the peaks is 
estimated to be 0.0026eV. The expected spacing (see 
Figure 9.31) is 2B, or 


i? (he)? /4n7 
2B = _ 
mR, (me*)Re, 
_ (1240 eV - nm)? /4:r? 
~ (0.98 u x 931.5 MeV/u)(0.127 nm)? 





= 0.00265 u 





This calculation gives excellent agreement with the value 
estimated from the spectrum. 
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Why does H, have a smaller radius and a greater binding 
energy than Hy? 

The molecule LiH has a simple electronic structure. The H 
atom would like to gain an electron to fill its 1s subshell, but 
the Li atom would similarly like to fill its 2s subshell. Based 
on the atomic structure of H and Li, which would you expect 
to dominate in the desire for an additional electron? Are the 
electronegativity values of Table 9.6 consistent with this? 
Consider molecules in which an alkali element X (X = Li, 
Na, K, Rb) is paired with either F or Cl to make XF or XCl. 
The molecule XF always has a greater dissociation energy 
and a smaller separation distance than the molecule XCl. 
Explain. 

Ht has a greater bond length (equilibrium separation) than 
H,, but OF has a smaller bond length than O,. Explain. 

In general, would you expect ss bonds or pp bonds to be 
stronger? Why? 

Explain why the bond angles of the sp directed bonds 
(Table 9.3) approach 90° as the atomic number of the 
central atom increases. 

How do the molecular force constants k compare with 
those of ordinary springs? What do you conclude from this 
comparison? 


9.1 The Hydrogen Molecule 


Ik 
2, 


Calculate the ionization energy of H,. 

Suppose a spherically symmetric negative charge of diam- 
eter R,, were centered between two protons at the Hy 
equilibrium configuration, so that the two protons just touch 
the surface of the sphere. What charge on the sphere is 
required to give the system a binding energy of 2.7 eV? 


9.2 Covalent Bonding in Molecules 


3. 


Bond strengths are frequently given in units of kilojoules 
per mole. Find the molecular dissociation energy (in eV) 
from the following bond strengths: (a) NaCl, 410 kJ/mole; 
(b) Li,, 106 kJ/mole; (c) Nj, 945 kJ/mole. 

Following the method of Example 9.1, predict the relative 
dissociation energies of the molecules NO, NF, and OF. 
Check your prediction by obtaining the bond strengths from 
tabulations. 

For each of the following pairs of molecules, predict which 
will have the larger dissociation energy: (a) Li, and Be,; 
(b) B, and C,; (c) CO and O,. Check your prediction 
against tabulated values of the bond strengths or dissociation 
energies. 


10. 


dal 


12. 


13: 


Why is it unnecessary to consider rotations of the diatomic 
molecule of Figure 9.27 about the x axis? Estimate the 
rotational inertia for rotations about the x axis; compare 
the typical rotational energies with corresponding values for 
rotations about the y or z axis. 

Explain how the equilibrium separation in a molecule can be 
determined by measuring the absorption or emission lines 
for rotational states. 

How would the rotational energy spacing of D, (a form of 
“heavy hydrogen”; the mass of D is twice the mass of H) 
compare with the rotational spacing of H,? How would their 
vibrational energy spacings compare? Their equilibrium 
separation distances? 

For a molecule like HCl, estimate the number of rotational 
levels between the first two vibrational levels. 

Why does an atom generally absorb radiation only from 
the ground state, while a molecule can absorb from many 
excited rotational or vibrational states? 

If a collection of molecules were all in the VN =0,L=0 
ground state, how many lines would there be in the absorp- 
tion spectrum? 


(a) Which molecule would you expect to have the greater 
dissociation energy, F, or BE ? (b) N, or Ny ? (c) NO or 
NO* ? (d) CN or CNT ? Check your predictions against 
tabulated dissociation energies. 


9.3 lonic Bonding 


7. 


10. 


11. 


Calculate the Coulomb energy of KBr at the equilibrium 
separation distance. 

The ionization energy of potassium is 4.34 eV; the electron 
affinity of iodine is 3.06 eV. At what separation distance will 
the KI molecule gain enough Coulomb energy to overcome 
the energy needed to form the K* and I” ions? 

(a) Assuming a separation of 0.193nm, compute the 
expected electric dipole moment of NaF. (b) The mea- 
sured dipole moment is 27.2 x 10-39 C-m. What is the 
fractional ionic character of NaF? 

The equilibrium separation of HI is 0.160 nm and its mea- 
sured electric dipole moment is 1.47 x 10739 C- m. Find the 
fractional ionic character of HI. 

The equilibrium separation of BaO is 0.194nm and the 
measured electric dipole moment is 26.5 x 107° C- m. Cal- 
culate the fractional ionic character, assuming two valence 
electrons. 


9.4 Molecular Vibrations 


12. 


13. 


14. 


15. 


The effective vibrational force constant of CN is 1.017 x 
10* eV/nm?. (a) Relative to the ground state, find the ener- 
gies of the first and second excited vibrational states. (b) In 
ionized CN* the force constant is weaker by 3.42%. What 
would be the energies of the first and second excited states 
in CNT? 

The vibrational transition in BeO is observed at a wave- 
length of 6.724 wm. What is the effective force constant of 
BeO? 

Figure 9.33 shows the energy minimum of molecular NaCl, 
through which a parabola has been drawn. Following the 
methods of Example 9.3, find the effective vibrational force 
constant, the vibrational frequency and wavelength, and 
the vibrational photon energy for NaCl. In what range 
of the electromagnetic spectrum would such radiations be 
found? What is the maximum vibrational quantum num- 
ber for which the parabolic approximation remains valid 
for NaCl? 
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FIGURE 9.33 Problem 14. 


The vibrational energy of CO is 0.2691eV when the 
constituents of the molecule are the most abundant iso- 
topes of carbon (m = 12.00 u) and oxygen (m = 16.00 u). 
(a) What would be the vibrational energy if the oxygen were 
replaced by the less abundant isotope with m = 18.00 u? 
(6) What would be the vibrational energy if the carbon in 
the original CO were replaced with radioactive carbon (used 
in radiocarbon dating) with mass 14.00 u? 


9.5 Molecular Rotations 


16. 


17. 


18. 


Derive Eq. 9.14 from the definition of the rotational inertia 
of a particle. 

Following the method of Example 9.4, calculate the energies 
and wavelengths of the three lowest rotational transitions 
emitted by molecular NaCl. In what region of the electro- 
magnetic spectrum would these radiations be found? 

The atoms of a carbon dioxide molecule (CO,) are arranged 
in a line, with the carbon in the center and the oxygens 


19), 


20. 
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on either side at a distance of 0.116nm. In the absence of 
vibrations, calculate the energies of the first five rotational 
states of CO, for rotations about an axis through the center 
of mass of the molecule. 

Find the difference in the rotational parameter B of HC! for 
the two different masses of Cl (35 u and 37 u). 

In observing the rotational transitions of a certain molecule, 
two photons are observed at wavelengths of 526.6 4m and 
658.3 zm. There are no photons emitted with wavelengths 
between these two values. (a) Determine the rotational 
parameter B for this molecule, and find where these wave- 
lengths occur in the sequence of rotational transitions. 
(b) From other experiments with this molecule, it is known 
that the equilibrium separation is 0.1172 nm. It is also known 
that this molecule contains carbon. Determine the identity 
of the other atom. 


9.6 Molecular Spectra 


21; 


22. 


23. 


24. 


22. 
26. 


27. 


Figure 9.30 illustrates the combined rotational-vibrational 
structure when the vibrational energy is much larger than 
the rotational energy. Make a sketch showing the reverse 
situation, in which the rotational energy is much larger than 
the vibrational energy. Use a scale in which B = 10 units 
and Af = 2 units; show rotational levels up to L = 3 and 
vibrational levels to N = 3. 

(a) Sketch a diagram, similar to Figure 9.30, showing all 
possible absorption transitions from the V = 0 to the N = 1 
states. Include rotational states up to L = 5. (b) Using the 
values hf = 10 units and B = 0.25 unit, show the energy 
spectrum of the absorption, including all transitions from 
part (a). Label each transition with the initial and final 
quantum numbers. 

Based on the energy levels given by Eq. 9.19, calculate the 
energies of the emitted photons. 

(a) What is the reduced mass of the KCl molecule? 
(6) With an equilibrium separation of 0.267 nm, find the 
spacing of the transitions in the combined rotational- 
vibrational spectrum. 

Derive Eq. 9.23 from Eq. 9.22. 

Ina collection of HCI molecules, at what temperature would 
the number in the first excited vibrational state be 1/3 the 
number in the ground state? (Ignore the rotational structure.) 
The most intense absorption line in the rotational-vibrational 
spectrum of CO at room temperature occurs for L = 7. Jus- 
tify this value with a calculation. (The equilibrium separation 
of CO is 0.113 nm.) 


General Problems 


28. 


The energy of a diatomic molecule can be represented by 
the empirical function 


A B 


RoR 
where A and B are positive constants. Find A and B in terms 
of the equilibrium separation R,, and the dissociation energy 
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29. 


30. 


31. 


32. 


33. 


34. 


E). Sketch the resulting function, choosing values for the 
constants appropriate for H,. 

Show that the angle between bonds in the tetrahedral carbon 
structure (Figure 9.14) is 109.5°. 

Complete the following table, which compares properties of 
H, molecules when one or both of the H is replaced with an 
atom of deuterium (D), which is a “heavy hydrogen” atom 
with twice the mass of ordinary hydrogen. 


Vibrational 
Molecule frequency Reg B 
H, 1.32x 10Hz 0.074nm 0.0076 eV 
HD 
D, 


Because of the “zero-point” energy (see Figures 9.23 and 
9.29), the ground state of a molecule is not at the minimum 
of the molecular energy curve but at an energy of Af/2 above 
the minimum. The dissociation energy is the energy needed 
from that state to break apart the molecule. (a) Given that 
the vibrational energy of H, is 0.54 eV, find the “zero-point” 
energies of H,, HD, and D,. Assume the mass of deuterium 
(“heavy hydrogen”) is twice the mass of hydrogen. (b) Given 
that the dissociation energy of H, is 4.52 eV, find the dis- 
sociation energies of HD and D,. Assume that the added 
neutron does not change the molecular energy curve or the 
equilibrium separation. 

Estimate the number of rotational states between the vibra- 
tional states for (a) H, (see Example 9.3); (6) HCl (see 
Example 9.6); (c) NaCl (see Problems 14 and 17). 

Figure 9.34 shows the absorption spectrum of the molecule 
HBr. Following the basic procedures of Section 9.6, find: 
(a) the energy of the “missing” transition; (b) the effective 
force constant k; (c) the rotational spacing 2B. Estimate 
the value of the rotational spacing expected for HBr and 
compare with the value deduced from the spectrum. Why 
are there only single lines and not double lines as in the case 
of HCl? 

(a) In a collection of H, molecules at room temperature, 
what is the ratio of the number of molecules in the N = 1 


35. 
36. 


37. 
38. 


39. 


40. 
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FIGURE 9.34 Problem 33. 


0.340 


vibrational state to the number in the N = 0 vibrational 
state? (Ignore the rotational structure for this problem.) 
(b) What is the ratio of the number in the NV = 2 state to the 
number in the ground state? 

Repeat Problem 34 for NaCl molecules. 

(a) In a collection of H, molecules at room temperature 
in the N = 0 vibrational state, find the relative number of 
molecules in the first three rotational excited states compared 
with the number in the ground state. (b) Repeat part (a) if 
the temperature is 30 K. (Hint: Don’t forget the degeneracy 
of the levels.) 

Repeat Problem 36 for NaCl. 

(a) Predict the fractional ionic content of KBr based on 
Figure 9.22. (b) Calculate the fractional ionic content and 
compare with your prediction. 

In a 1987 study of emissions from molecules in the inter- 
stellar medium, the following frequencies were reported: 
93.98 GHz, 140.97 GHz, and 234.94 GHz. (a) Assuming 
these radiations to be emitted in transitions between rota- 
tional states of a diatomic molecule (but not necessarily to 
be consecutive in sequence), what is the largest possible 
value of the rotational spacing 2B that would be consistent 
with these emissions? (b) Where do these transitions fit in 
the sequence of rotational emissions from this molecule? 
Give the corresponding angular momentum quantum num- 
bers. (c) The paper that reported these emissions claimed 
them to be the first identification of phosphorus in the inter- 
stellar medium and that the molecule observed was PN. 
Verify that the observed rotational emissions are consistent 
with PN. 

The star known as IRC +10216 or CW Leonis is a red 
giant in the late stages of its evolution. It is surrounded by a 
dust cloud that is rich in a variety of chemical elements that 
have been produced in fusion reactions in the star. Among 
the molecules observed in the dust cloud is AICI, which 
has been identified through two close-lying strong rotational 
emissions with frequencies 160.312 GHz and 156.547 GHz. 
(a) Locate these transitions among the rotational states of 
AICI. (b) What temperature is necessary to excite AICI in 
order to produce these rotations? 


Some processes in nature are determined by the statistical distribution of random events. 
When the number of particles in a system is very large, the range of likely outcomes 
becomes very narrow, and the processes may seem deterministic. Other processes may 
result from deterministic laws but seem chaotic in their outcomes. Such processes can be 
represented by apparently irregular geometric patterns called fractals. 
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FIGURE 10.1 (top) The line spectrum 
of mercury in the visible region as it 
might be analyzed using a diffraction 
grating. (bottom) The intensity spec- 
trum of the transitions. 
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FIGURE 10.2 The continuous spec- 
trum of an incandescent source in the 
visible region. 


Many physics experiments are analyzed as if the interactions take place in single, 
isolated events. The emission of light from a gas at low density, Rutherford 
scattering, and Compton scattering are examples of experiments that can be 
analyzed in this way. On the other hand, consider the addition of energy to a gas 
in a container by raising its temperature. We know how the average energy of the 
atoms will change, but we can’t analyze the behavior of individual atoms. 

This sharing of energy among the many parts of a system cannot be simply 
discussed in terms of single isolated events. The analysis of such cooperative 
phenomena requires the techniques of statistical physics, in which we are con- 
cerned not with calculating the exact outcome of single, isolated events, but 
with predicting the average outcome of many cooperative events, based on the 
statistical distribution of the possible outcomes. 

In this chapter we discuss the laws of statistical physics, and we illustrate some 
systems that are governed by classical statistics and some others that require 
quantum statistics. These statistical concepts are necessary to understand the 
bulk properties of matter, which are discussed briefly in this chapter and more 
extensively in Chapter 11. 


When we pass an electric current through a tube containing a low-density gas, 
such as mercury vapor, light is emitted. An electron pushed to an excited state of 
an atom returns to the ground state with the emission of one or more photons. 
In the case of mercury vapor, we see individual photons corresponding to green 
light, blue light, orange light, and so forth. Each photon has a definite wavelength 
and corresponds to a transition between two levels of definite energies. Aside 
from the effect of the uncertainty principle, the wavelengths are “sharp.” If we 
analyze the light with a high-resolution device such as a diffraction grating, the 
resulting spectrum (Figure 10.1) shows the sharpness of the spectral lines. We can 
understand this spectrum based on our knowledge of the excited states of a single 
mercury atom; as long as the density of the gas is low, the number of atoms in 
the tube does not affect the observed spectrum. We treat the light emitted by this 
collection of atoms as if the individual emissions occur singly and in isolation. 

Consider now the contrasting case of the tungsten filament of an ordinary 
incandescent light bulb. Figure 10.2 shows the spectrum in this case, which has the 
continuous distribution of wavelengths that we call “white” light. All wavelengths 
are present, not just a finite number. Isolated tungsten atoms, like those of mercury, 
emit light at a finite number of discrete, well-defined wavelengths, but in a solid 
tungsten filament the cooperative effect of other nearby atoms changes the energy 
levels and makes the spectrum continuous. Even though the mercury vapor and 
the tungsten filament may contain roughly the same number of atoms, in one case 
we can ignore the presence of the other atoms, while in the other case we must 
consider the mutual influence of many or all of the atoms in the sample. 

There are two ways to approach the analysis of a complex system. The first 
is to specify a set of microscopic properties, such as the position and velocity 
of each atom. For even a small system containing perhaps 10!> atoms, this is 
obviously a hopeless task. The second way is to recognize that such a description 
is not only impossibly complex, it is also unnecessary because it provides far 
more detail than is useful. We can understand and predict the behavior of systems 
containing many particles in terms of a few macroscopic properties, such as the 


temperature or the pressure of a gas. The development of relationships between 
microscopic and macroscopic properties was one of the great triumphs of 19th- 
century physics. In the case of a gas confined to a container, for example, kinetic 
theory gives the relationship between the microscopic motion of the molecules 
and the macroscopic temperature and pressure. 

More generally, we can make a statistical analysis by counting the number of 
different arrangements of the microscopic properties of a system. For example, 
consider the distribution of 2 units of energy to a “gas” of four identical but 
distinguishable particles. Each particle can acquire energy only in integral units 
(in analogy with the simple harmonic oscillator). How can these four particles 
share the 2 units of energy? One way is for one particle to have the entire 2 units. 
There are four different ways to accomplish this distribution, corresponding 
to choosing each of the four particles to take the 2 units of energy. Another 
way to distribute the energy is to give two different particles 1 unit each. There 
are six ways to accomplish this distribution (Table 10.1). Each possible energy 
distribution is called a macrostate—a state of the system that can be observed 
through the measurement of a macroscopic property such as the temperature. In 
our simple system, there are two macrostates: macrostate 4 in which one particle 
has 2 units of energy, and macrostate B in which two particles each have | unit of 
energy. The different arrangements of microscopic variables corresponding to a 
single macrostate are called microstates. In our system, there are four microstates 
corresponding to macrostate A and six microstates corresponding to macrostate 
B. The number of microstates corresponding to a given macrostate is called the 
multiplicity W. For our system, W, = 4 and W, = 6. 

One application of these statistical principles is to determine the direction of the 
natural evolution of a system. The second law of thermodynamics says, in effect, 
that isolated systems evolve in a direction such that the multiplicity increases.* 


TABLE 10.1 Macrostates of a Simple System 





























Microstate (Energy of Particle) 
Macrostate | Particle 1 | Particle 2 | Particle 3 | Particle 4 
A p; 0 0 0 
0 2 0 0 
0 0 2 0 
0 0 0 2 
B 1 1 0 0 
1 0 1 0 
1 0 0 1 
0 1 1 0 
0 1 0 1 
0 0 1 1 




















*The second law is often expressed in terms of the entropy S by saying that an isolated system must 
evolve such that AS > 0. The Austrian physicist Ludwig Boltzmann (1844—1906) developed the 
relationship between the entropy of a system and the multiplicity of its macrostate: S = k ln W, where 
k is the Boltzmann constant. The increase in the entropy of a system as it changes from one macrostate 
to another is thus equivalent to an increase in the multiplicity. 
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That is, if we started our system in macrostate A and allowed the four particles 
to interact with one another, later we might find it in macrostate B; however, 
if the system began in macrostate B, we would be less likely to find it later in 
macrostate A, because a change from B to A involves a less probable decrease 
in multiplicity. As the number of particles in the system increases, differences 
in multiplicity become greater, and changes involving decreases in multiplicity 
become so improbable as to be unobservable. 
Implicit in this statistical analysis is the following postulate: 


All microstates of a system are equally probable. 


Our system can be found with equal probability in any of the 10 microstates 
listed in Table 10.1. It is this postulate that allows us to assign a greater statistical 
weight to macrostate B; the system can be found with equal probability in any of 
the six microstates of B or the four microstates of A, so the relative weight of B 
is 6/4 = 3/2. Ina large number of identical systems, we expect to find 40% in 
macrostate A and 60% in macrostate B. 

For another example, in the card game of poker a royal flush consisting of the 
AY, KY, QY, JY, 10M is just as probable as the particular worthless hand 10@, 
8h, 5%, 49, 2. What makes a royal flush so special is that there are only four 
possible royal flushes among the 2,598,960 possible poker hands, most of which 
are worthless. Even though a royal flush is just as likely as a particular worthless 
hand, it is much less likely than any worthless hand. In the language of statistical 
physics, the macrostate “royal flush” has fewer microstates (and is therefore less 
probable) than the macrostate “worthless hand” or the macrostate “one pair.” 

The assumption of equal probabilities of the microstates allows us to do 
calculations on the system. For example, suppose we have a large number 
of identical systems like that of Table 10.1 whose microstates are distributed 
randomly. Let us reach into each system and measure the energy of a particle. 
What is the probability that the particle will have 2 units of energy? Among the 
10 equally probable microstates consisting of a total of 40 particles, there are four 
particles with 2 units of energy. Thus p(2) = 4/40 = 0.10, and we expect that 10% 
of our measurements would find E = 2. Similarly, p(1) = 0.30 and p(0) = 0.60. 

The statistical analysis of a complex system gives us a way of describing the 
state of the system, its average properties, and its evolution in time. Physics is 
often concerned with exactly these details, so the statistical analysis is very useful 
indeed. It can be applied to systems in which the number of particles is very small, 
such as a nucleus with the order of 10? particles, or very large, such as a star 
like the Sun with 10°’ particles. Our next task is to determine whether there are 
differences between the statistical behaviors of classical and quantum systems. 


10.2 CLASSICAL AND QUANTUM STATISTICS 


To illustrate the differences between classical and quantum statistics, we first 
consider another example similar to the one of the previous section: the distribution 
of a total of 6 energy units to a collection of five identical but distinguishable 
oscillator-like particles, each of which can absorb energy in equal increments of 
1 unit. Instead of a tabulation similar to Table 10.1, the energy distribution is 
illustrated in Figure 10.3. There are ten macrostates, labeled A through J. Each 
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FIGURE 10.3 The macrostates of a system in which five identical particles share 6 units of energy. 


dot indicates a particle with a certain energy; for instance, in macrostate B there 
are three particles with energy E = 0, one with E = 1, and one with £ = 5. The 
multiplicity of each macrostate (the number of microstates) can be worked out 
in tabular form, or it can be calculated directly using standard methods from 
permutation theory: 


N! 
w= 
No!N,!N5!N3!N4!N5!Ne! 





(10.1) 


where N is the total number of particles and N;, is the number with energy E. 

The results of this calculation are an example of a more general approach that 
gives the total number of microstates when N distinguishable particles share O 
integral units of energy: 


(V+Q- 1)! 


mer (10.2) 


total = 


so for our case we expect the total number of microstates to be 10!/6!4! = 210, 
in agreement with the total of the values of W given in Figure 10.3. 

As we did previously, let us calculate the probability to measure any particular 
value of the energy of a particle. This can be done by considering a collection of 
all 210 possible microstates and counting the number of times each value of the 
energy appears. Symbolically, 


LN, 


P= TW, 


= 





(10.3) 


where WN; represents the number of particles with energy E£ in each particular 
macrostate. The sums are carried out over all of the macrostates (10 in the case of 
Figure 10.3). 

Table 10.2 gives the probabilities associated with measuring each of the 
possible energies. Note that the probability decreases with increasing energy. The 
probabilities are plotted in Figure 10.4. The smooth curve is an exponential of the 


TABLE 10.2 Energy Probabilities for 










































































the System of Figure 10.3 
Energy Probability 
0 0.400 
1 0.267 
2 0.167 
3 0.095 
4 0.048 
5 0.019 
6 0.005 
0.5 
0.4 
s 0.3 
792 
0.1 
9 2 3 4 5 6 
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FIGURE 10.4 The probabilities from 
Table 10.2. The curve is an expo- 
nential function that closely fits the 
points. 
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TABLE 10.3 Energy Probabilities for 
Quantum Particles 













































































Probability 
Energy | Integral Spin | Spin 1 
0 0.420 0.333 
1 0.260 0.333 
2 0.160 0.200 
3 0.080 0.067 
4 0.040 0.067 
5) 0.020 0.000 
6 0.020 0.000 
O:5 
0.4 
g 0.3 
Q 
0.2 
0.1 
O6 2 3 5 6 


FIGURE 10.5 The probabilities from 
Table 10.3 for particles with inte- 
gral spin. The curve that characterizes 
the points rises more steeply than the 





exponential at low energy. 
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FIGURE 10.6 The probabilities from 
Table 10.3 for spin %. The curve 
that characterizes the points becomes 





approximately flat at low energies. 


form p « e~”, where £ is a constant chosen to fit the data. You can see that the 
decrease of p with increasing E is approximately exponential. 

This example represents the application of classical statistics. Although real 
systems composed of many particles have too many macrostates and microstates to 
tabulate, we can analyze their properties by determining the distribution function 
that describes how the energy is shared among the parts of a system. Later in 
this chapter we discuss the application of classical statistics to the distribution 
of energies of the molecules of a gas. Our experience with this simple example 
serves as a useful guide: The true classical distribution function turns out to be an 
exponential of the form e F= However, there are other phenomena, such as the 
electrical conductivity and heat capacity of metals, the behavior of liquid helium, 
and thermal radiation, which cannot be successfully analyzed using classical 
statistics. For these phenomena, we must use the methods of quantum statistics. 

Why should classical statistics differ from quantum statistics? Two of the 
assumptions made in the example of this section are inconsistent with basic 
principles of quantum physics: 


1. In quantum physics, identical particles must be treated as indistinguishable. 
In calculating the multiplicity of the macrostates, we assumed the classical 
particles to be identical but distinguishable. That is, the particles are numbered 1 
through 5 (or carry other distinguishing marks). In macrostate A, for instance, it 
is possible to distinguish the microstate in which particle 1 has E = 6 from the 
microstate in which particle 2 has E = 6, so we count each of these as separate 
microstates in determining the multiplicity of 5. If we regard the particles as 
indistinguishable quantum particles (such as electrons or photons), we can’t tell 
the difference between these microstates. If we can’t observe the five microstates 
of A as separate arrangements, we can’t count them as separate arrangements. 
For identical quantum particles, the multiplicity of each macrostate becomes 
exactly 1. 

2. Quantum mechanics can impose limits on the maximum number of particles 
that can be assigned to any particular state. For example, suppose the particles in 
our example are electrons. The Pauli principle forbids two electrons in a system 
from being in the same state of motion (or having the same set of quantum 
numbers). Electrons can have spin up or spin down, so if there are no other 
quantum numbers associated with these energy states, there can be no more than 
two electrons in any energy state. For electrons, macrostates A, B, C, F, G, H, and 
J are forbidden, because in each one there are more than two particles in the same 
energy state. 


We can repeat the calculation of the probability p(£) for two cases of quantum 
particles: photons or alpha particles, which have integral spins and are not 
restricted by the Pauli principle, and electrons or protons, which have a spin of 4 
and are restricted by the Pauli principle to no more than two per energy state. In 
the first case, we have ten macrostates with equal multiplicities of 1, and in the 
second case we have three macrostates (D, E,/) again with equal multiplicities 
of 1. Table 10.3 gives the resulting values of p(£). The probabilities for the 
particles with integral spins are plotted in Figure 10.5; the curve is approximately 
exponential but rises a bit more steeply at low energies. The values for the 
particles with s = 14 are plotted in Figure 10.6; the behavior is no longer even 
approximately exponential; instead, it is rather flat near E = 0 and then drops 
rapidly to 0 for the higher energies. 
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The example we have analyzed in this section should be regarded only as 
indicating some differences between classical and quantum statistics. The true 
energy distribution functions for many-particle systems look different from the 


ones obtained for this simple five-particle system. 


Particles with integral spin are described by a quantum distribution function 
that is approximately exponential but that rises more sharply at low energies; 
this difference is responsible for a number of interesting cooperative effects. 
Particles that obey the Pauli principle are described by a very different quantum 
distribution that becomes flat at low energies, like Figure 10.6. The quantum 
distribution functions for these two cases are discussed later in this chapter. 


| Example 10.1 





For the case in which five particles share 6 units of 
energy, calculate the probability to observe a particle with 
2 units of energy for (a) distinguishable classical particles, 
(b) indistinguishable quantum particles with integral spin, 
and (c) indistinguishable quantum particles with half- 
integral spin. 


Solution 

(a) For distinguishable classical particles, we can use 
Equation 10.3 with the multiplicities for the 10 macrostates 
given in Figure 10.3 to sum the probabilities to find 2 
units of energy in all of the macrostates. The numerator of 
Eq. 10.3 evaluates to: 


SON W,=0x54+0x 2041 x 2040 x 3041 x 60 
+0x 10+3x10+0x20+2x30+1x5 
= 175 

and so 


LNW, _ 175 
NW, 5x 210 





p(2)= = 0.167 


(6) For indistinguishable quantum particles with integral 
spin, each of the macrostates has multiplicity 1, so the 
numerator of Eq. 3 becomes 


YON W=0x1+0x1+1x14+0x14+1x1 
+0x14+3x14+0x14+2x14+1x1l=8 


and 


LNW, 8 


= = 0.160 
NYLW, 5x10 


p(2) = 





(c) For indistinguishable quantum particles with half- 
integral spin, the states with more than 2 particles having 
the same value of the energy are not permitted and so 
have multiplicity 0 (states 4,B,C,F,G,H, and J). The 
remaining allowed states have multiplicity 1. The sum is 
now 


Oxl+i1x14+2x1 
5x3 





p(2) = = 0.200 





| Example 10.2 





A system consists of two particles, each of which has a spin 
of 1. What are the possible values of the z component of 
the total spin and what is the multiplicity associated with 
each value, assuming the particles to be (a) distinguishable 
and (b) indistinguishable? 


Solution 

For each particle, we can have m, = +1,0,—1 and for 
their combination we have My = m,; + my where Msh 
gives the z component of the total spin S. Clearly M, runs 


from a maximum of +2 to a minimum of —2. (a) For 
distinguishable particles, there are 3 possible m, values for 
each particle and so a total of 3 x 3 = 9 combinations. 
None of these combinations is prohibited. (6) For indis- 
tinguishable particles, we cannot count m,; = +1,m,. = 0 
andm,, = 0,m,. = +1 as separate microstates, because we 
cannot distinguish particle 1 from particle 2. So there are 3 
fewer microstates when the particles are indistinguishable. 
The multiplicities are: 
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Macrostate Microstates 
My My, Distinguishable | Indistinguishable 

+2 +1,4+1 1 1 
+1,0 

+1 2 1 
0,+1 
+1,-1 

0 —1,+1 3 2 
0,0 
0,-1 

—1 2 1 
—1,0 

—2 —1,-1 1 1 

















For distinguishable particles, the possible total spins 
are S=2 (M; = +2,+1,0,—1,—2 for a multiplicity of 
5), S= 1 (My = +1,0, —1 for multiplicity 3), and S = 0 
(M, = 0, multiplicity 1). The total multiplicity is 5 + 3 + 
1=9. 

For indistinguishable particles, because we have states 
with M, = +2 we clearly must have S = 2. Assigning 
5 of the microstates (M, = +2,+1,0,—1,—2) to S=2, 


there is only 1 microstate left, with M; = 0, which must be 
assigned to S = 0. Note that for distinguishable particles we 
can have S = 2, 1, or 0, while the indistinguishable quantum 
particles with integral spin can combine only to S = 2 or 
0. This suggests another important difference between the 
statistics of classical and quantum systems—for quantum 
systems, the statistics will allow some spin combinations 
and forbid others. 





10.3 THE DENSITY OF STATES 


We have seen in the previous sections how the multiplicity of a state determines 
the probability to find a system in that state. We are now going to extend that 
concept to more complicated systems. 

Let’s consider a system (classical or quantum) composed of a large number of 
particles that can exist in many different energy states. The relative probability 
for the particles to have energy E is given by the distribution function f(E). This 
function might tell us, for example, that it is less likely to find a particle in the 
system with larger amounts of energy than with smaller amounts. For example, 
if our system is a gas at temperature 7, it is increasingly less probable to find 
molecules with energy far above kT (where k is the Boltzmann constant). 

Our ultimate goal is to calculate the number of particles with any given value of 
the energy EZ. This number is in part determined by the distribution function f(£). 
However, the calculation must also take into account that there may be many states 
available at the energy E. This additional factor is related to the multiplicity of 
microstates that we have considered in systems with small numbers of particles. In 
effect, to find the number of occupied states we combine the number of available 
states with the probability that each state is occupied. 

There are two different ways of calculating the number of available states at 
the energy E. If the energy states are discrete and individually observable, the 
number of states available at the energy E is just the degeneracy of the state at that 
energy. For example, if our system is a dilute gas of hydrogen atoms, the number 


of states available at the energy £,, for a state with principal quantum number n 
is the degeneracy of that state, which is 2n*. As we go to higher excited states, 
the number of atoms in each state might decrease due to the distribution function 
f(E) and might increase due to the degeneracy factor 2n*. The actual number of 
atoms that might be found in the excited state with a particular energy represents 
the combined effect of both factors. 

For another example, consider the rotational excited states in a molecule such 
as HCl. The degeneracy of each level of energy £, with angular momentum 
quantum number L is 2L + 1. As we go to higher energies corresponding to larger 
values of the angular momentum, the distribution function decreases and the 
degeneracy factor increases, so that the net population of any level is determined 
by the combined effect of both factors. In Section 9.6, we saw an example of how 
these two factors produced a maximum intensity in the absorption spectrum for 
the state with a certain value of L. 

The degeneracy factor and the distribution function can be combined to enable 
us to calculate the number of particles in the system having a certain value of the 
discrete energy E,,: 


N, = 4d, f(E,) (10.4) 


where d,, is the degeneracy of the level at energy £,,. The distribution function 
f(E,,) 1s properly normalized to ensure that the total number of particles in our 
system is N: 


N=) N=} 47) (10.5) 


where the sum is carried out over all the states (although often in practice the 
distribution function f decreases so rapidly for increasing £ that it is sufficiently 
accurate to include only a few terms in the summation). 

On the other hand, we might be analyzing a system in which the energy states 
are so numerous and spaced so closely together that we cannot observe them as 
separate and individual states. As a result, we may consider only the number of 
states in a small interval dE at the energy £, or equivalently the number of states 
between energies & and E + dE. This method brings to mind the analysis of the 
kinetic energy of the molecules in a gas (see Section 1.3) and the analysis of the 
probability distribution corresponding to solutions of the Schrodinger equation 
(see Section 5.3). 

For this analysis we treat E as a continuous variable. (Of course, we know the 
states are discrete, but there are so many of them and they are so close together 
that this is really a very good approximation.) We define the density of states g(E) 
so that g(E)dE is the number of available states per unit volume in the interval dE 
at energy E (or between £ and E+ dE). The number of populated states in that 
interval is determined by the density of states factor and the distribution function: 


dN = N(E)dE = Vg(£) f(£) dE (10.6) 


where V is the volume of the system (which must be included because the density 
of states is the number of states per unit volume). Again we'll assume that the 
distribution function is properly normalized so that the total number of particles 
in the system is fixed at NV: 


N= [= [xe dE = y [ee neva (10.7) 
0 0 
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FIGURE 10.7 One-eighth of a spher- 
ical shell of radius n and thickness dn 
in the n,,n,,n, coordinate system. 


Be sure to remember the difference between N and N(E£). The symbol N represents 
the total number of particles in the system, while V(£) represents the number per 
unit energy interval at energy E. That is, N is a pure number, while N(E£) is a 
function of E and has units of (energy)~!. 

In this chapter we discuss the application of classical statistics to the molecules 
of a gas and the application of quantum statistics to the electrons in a metal, which 
we can consider to be a “gas” of electrons. We also discuss the application of 
quantum statistics to the problem of cavity radiation that we first introduced in 
Chapter 3, so we’ll need to understand the density of states of a gas of photons. 


Density of States in a Gas of Particles 


We’ll first discuss the density of states in a “gas” of particles, such as electrons 
or molecules. This turns out to be a useful concept in numerous applications. For 
example, the space around a hot metal filament is filled with a cloud of emitted 
electrons. In a solid metal conductor such as copper, each atom contributes one 
loosely bound electron to the electric current; because these electrons belong to 
the material as a whole rather than to individual atoms, it is appropriate to treat 
the electrons as a gas in the metal. The same calculation applies, as we shall see, 
to an ordinary molecular gas such as nitrogen. 

The basis of our calculation is the particle trapped in a three-dimensional region 
of space, which we assume to be a cubical box in which each side has length L. 
The cubical box represents a three-dimensional potential energy well. (See, for 
example, the analysis of the two-dimensional potential energy well in Section 
5.4.) By an obvious extension of Eqs. 5.41 and 5.45, the wave function and energy 
levels for the infinite three-dimensional potential energy well can be written as 











TC NE aay re (10.8) 
aia L L L 
and 
ip 1» 2 2 2 2 2 he 2 
E el -)=— J = 10.9 


where n? = n2 + ny +n, 

So now the problem that confronts us in finding the density of states is: how 
many different combinations of the quantum numbers n,,n,, and n, give us a 
particular value of the energy E? If the energy levels were well separated and 
could be observed individually, we could find their degeneracy and thus obtain 
the density of states. However, in our case the cube is so large compared with 
the typical de Broglie wavelength of the electrons that there is a huge number of 
energy levels in any interval dE, certainly far too many for us to attempt to count. 

One way to find the number of energy states is to imagine a coordinate system 
in which the axes are labeled n,,n,, and n, (Figure 10.7). The points with a 
common value of E lie on a spherical surface of radius n = \/nz + nz + n2, and 
the points with energies between £ and E + dE lie within a spherical shell between 
radius n and radius n + dn. Only positive values of n,,,, and n, may occur, and 
thus we have only one-eighth of the complete shell. The entire shell has surface 
area 47m” and thickness dn, so the number of points (each point representing a 
different combination of n, leading to the same value of £) in this portion 
of the shell is - x 4nn*dn. 


x? Ny» 


Finally, if the particles have spin s, each point in this coordinate system 
is really 2s+ 1 points, which represent each of the different possible spin 
orientations (because, in the absence of a magnetic field, each of these different 
spin orientations represents a different state having the same energy £). For 
example, if the particles are electrons, there are two spin orientations (one with 
m, = +1/2 and one with m, = —!/2), and for electrons 2s + 1 = 2. The density of 
states must include an additional factor of 2s + 1. Thus the number of states per 
unit volume is 


129+1 
g(n) dn = ae dn (10.10) 


The number of states contained in this interval is the same whether we count the 
states in terms of n or E. That is, g(n)dn = g(E)dE. With E = h?n*/8mL7, we 
have dE = 2h*n dn/8mL? and so 


Am(2s + 1)/2m3/? 
h3 





g(E) dE = g(n) dn = E'? dE (10.11) 


using V = L> for the volume of the cube. The density of states for the gas of 
particles is then 


An(2s + vam? pn _ 4n(2s + Lv2(me?) p 


gE) = i (hey 








(10.12) 


Density of States in a Gas of Photons 


Let’s consider now a gas of photons. Imagine a cavity such as we discussed in 
Section 3.3: a hollow metal box of volume V with walls at temperature 7. The 
box (in the form of a cube with each side of length L) is filled with photons 
having a range of energies from 0 to co (with an equivalent range of frequencies 
or wavelengths). 

Let’s imagine the box to be filled with electromagnetic standing waves. 
Any particular wave has energy E = pc = cy/p2 + p; + p?. Each momentum 
component must obey the standing wave condition for that particular wavelength, 
and applying the boundary conditions that the electric field must vanish at the 
locations of the walls gives exactly the same restrictions on the wavelengths or 
wave numbers as the analysis of the three-dimensional infinite potential energy 
well (for which w = 0 at x = 0 and x = L, and similarly for y and z). Thus, for 
example, for the x direction we have the wave number k, subject to the condition 
K.L=n,7, and so p, = hk, = hin,/L, with analogous results for the y and z 
directions. The indices n,,n,, and n, are independent of one another and take 


x2 ys 


positive values from | to co. We can therefore write the energy as 


Ex Pig ap? — Cs hmny oe 
=C Px Py Pp =e L L L 
h 
= — n+ n2 +n? (10.13) 








We now have exactly the same situation that we encountered for the electron gas 
shown in Figure 10.7. The number of points in the portion of the spherical shell 
is again proportional to its volume t x 4nn?dn. 
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Finally, we must account for the possible spin states of the photon. Because 
a photon has s = 1, we might expect that the density of states should include 
the factor 2s-+ 1 = 3. However, the s, = 0 state is not permitted for photons. 
This is equivalent to requiring that an electromagnetic wave have only one of 
two possible polarizations, either right circular or left circular. The factor that 
accounts for the spin multiplicity in the density of states is thus 2, and the number 
of states per unit volume is given by Eq. 10.10 with 2s + 1 replaced by 2. With 


E = hcn/L, we obtain dE = hic dn/L and finally 


1 
g(E) dE = g(n)dn= E° dE 


ST (10.14) 


with again V = L? for the volume of the cavity. The density of states for the 


photon gas is then 


— 87 F2 
(he) 





g(E) = (10.15) 


1 
E 
(hic)? 


Note that, in contrast to the gas of particles, in which the density of states depends 
on E'/?, the density of states for a photon gas depends on E”. 


| Example 10.3 





What is the number of available states per cubic meter 
in helium gas within a narrow interval of 0.0002 eV at 
the most probable molecular energy of 0.0086eV at a 
temperature of 200 K? 


Solution 
The density of available states is g(E)dE, with g(£) for par- 
ticles given by Eq. 10.12, using s = 0 and mc” = 3727 MeV 
for helium atoms and dE = 0.0002eV for the energy 
interval: 


Am/2(mc?)3/2 


1/2 
(ho E'*dE 


g(E)dE = 


_ A/2(3727 x 106 eV)*/? 
~ (1240 eV- nm)? (1m/10? nm)? 
x (0.0086 eV)!/7(0.0002 eV) 


= 3.9 x 107 m3 





There are about 107° gas atoms per m?, which suggests 
that only a very small fraction of the available states is 
populated in an ordinary gas. 





| Example 10.4 





The interior of a star can be considered to be filled with 
photons of energies in the range of 1 MeV. What is the 
density of available states for a container of photons of 
energies in an interval of 10 keV at 1 MeV? 


Solution 

From Eq. 10.15 we have 
87 2 ap — 870(10° an 
(hce)3 (1240 eV - nm) 
=13x10°m™> 





(10° eV)*(10* eV) 


Such a large number of available states can accommodate 
a large number of photons. Note that this result depends 
critically on the energy of the photons. For visible light 
(photons of order | eV), a similarly narrow interval could 
accommodate about 10!° energy states per m?. 
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10.4 THE MAXWELL-BOLTZMANN DISTRIBUTION 


Next we must consider the possible forms of the distribution function f(E), which 
describes how the available states at energy E are populated. We’ll begin by 
considering a classical system, in which the density is relatively low. In practice 
this means that the average spacing between the particles is large compared with 
their de Broglie wavelength. The individual particles may have quantized energy 
levels, such as the excited states of atoms or molecules, but the overall system 
does not show quantum behavior. This limit works perfectly well to describe 
gases under ordinary conditions of temperature and pressure. 

The distribution function that applies in this case is the Maxwell-Boltzmann 
distribution, which is given by 


ima (10.16) 


for a system in thermal equilibrium at temperature 7’. (Here & is the Boltzmann 
constant.) As the energy increases, the occupation probability drops off exponen- 
tially. We have included a normalization constant A~!, which helps to ensure that 
the total number of particles in the system is fixed at NV. (The reason for writing 
the normalization constant as A~! rather than simply A will become apparent later 
in this chapter.) 

Let’s apply the Maxwell-Boltzmann distribution to find the number of 
molecules having energy between F and E+dE in a gas at temperature T 
that fills a container of volume V with a total of N molecules. From Eq. 10.5 we 
have (using the density of states from Eq. 10.12) 


AnV (2s + 1)/2(mc?)3/2 


= EM2¢-EMT gr 
Cc 


(10.17) 





dN = N(E) dE = Ve(E)fyp(E) dE = Au 


The normalization constant is determined by the condition that the total number 

of molecules, obtained as in Eq. 10.7 by integrating over all energies, must be NV: 

AnV (2s + 1)V/2(me?)3/? 
(he)? 





oO es) 
N= fw -| N(E) dE = AT! i E'/2¢-E/KT gE 
0 0 


(10.18) 








The definite integral is a standard form, and carrying out the integration gives 











A-! = N(hcy /V(Qs+ 1)(2Qamc?kT)3/2. We obtain for the energy distribution of g 
gas molecules z ! 
N(E) = Ve(E)fyp(E) = oN pte (10.19) I~ 
- JI (kT)3/2 
0 F Em E 
which is identical to Eq. 1.22. This equation is the Maxwell-Boltzmann energy 
distribution. FIGURE 10.8 The Maxwell- 


This distribution is shown in Figure 10.8. Note that it rises to a maximum Boltzmann energy distribution for 
at the most probable energy E, = 347 and then falls gradually to zero as the molecules, showing the most proba- 
energy increases, which suggests that it is increasingly rare to find a molecule ble energy E, = 34T and the mean 
with energy much larger than kT. The shaded strip of width dE represents the energy £,, = 3kT . The shaded strip 
number dN = N(E)dE given by Eq. 10.17, the number of molecules between E represents the number of molecules dN 
and E + dE. The mean or average energy is E,, = 3kT , aS we Showed in Eq. 1.26. with energies between F and E + dE. 
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In normalizing Eq. 10.19, Planck’s constant has disappeared. We can therefore 
regard the Maxwell-Boltzmann energy distribution as describing the behavior of 
classical particles, such as an ordinary molecular gas, in which quantum effects 


do not contribute. 


| Example 10.5 





A container holds one mole of argon gas at a temperature 
of 375 K. Calculate the fraction of the molecules in the 
container with energies between 0.025 eV and 0.026 eV. 


Solution 

We'll assume that N(£) is varying sufficiently smoothly 
between 0.025 and 0.026 eV that we can use dN = N(E)dE 
to find dN. This avoids having to use an integral to 
find the number. From Eq. 10.19 we then have, with 


kT = (8.617 x 10-° eV) (375 K) = 0.0323 eV, 








“ = <2 dE = mae dE 
IU 
~ 1/2 
~ 4/7 (0.0323 eV)3/2 (0.0255 eV) 
x e 0.0255 eV/0.0323 °V(0.001 eV) 
= 0.014 = 1.4% 





| Example 10.6 





(a) In a gas of atomic hydrogen at room temperature, 
what is the number of atoms in the first excited state at 
E = 10.2 eV, expressed as a ratio to the number in the 
ground state? (b) At what temperature would we expect to 
find 1/10 as many atoms in the first excited state as in the 
ground state? 


Solution 

(a) For this case we use the discrete form of the expression 
for the number of atoms given by Eq. 10.4. The degeneracy 
factor d,, is 2n? for the state with principal quantum num- 
ber n. At room temperature (T = 293 K), kT = 0.0252 eV. 
The ratio of the number in the excited state (n = 2) to the 
number in the ground state (m = 1) is then 


—Ey [kT 
N, -_ dye 2/ = 8 (Ey-E)/kT = 4e7 10.2 eV/0.0252 eV 
N, dye Fisk 2 





= 4e-4 — 0.6 x 107!” 


In order to have one atom in the excited state, we there- 
fore require about 1.7 x 10!75 atoms of hydrogen, about 
3 x 10'48kg, a quantity greater than the mass of the uni- 
verse! 

(b) We now require N,/N, = 0.1 and solve for T: 


0.1 = 4e710-2ev/ar 


This gives kT = 2.77 eV or T = 3.21 x 107K. 





| Example 10.7 





A certain atom with total atomic spin ' has a magnetic 
moment jz = 1.2 tg. A collection of such atoms is placed 
ina magnetic field of strength B = 7.5 T. What is the ratio, 
at T = 77K (the temperature of liquid nitrogen), of the 
number of atoms with their spins aligned parallel to the 
field to those with their spins aligned opposite to the field? 


Solution 

The energy of interaction with the magnetic field is 
E= a8; so the energy of those aligned parallel to 
the field is E, =—pB while those aligned antiparal- 
lel have energy E,=+pB. The degeneracies of the 


states with m,=+4/. and m,=-—!'/2 are identical, 
so d, =d,. With wB = 1.2(9.27 x 10° J/T)(7.5T) = 
8.34 x 10-3 J = 5.21 x 10-4eV and kT = 0.00664 eV, 
we have: 


NM, den /AT 
Ny = dye EaltT 
— 02(5.21x10~4 eV) /0.00664eV _ 1 47 


= gE Ey)/kT 2B /KT 


That is, about 54% of the atoms are aligned parallel to 
the magnetic field and 46% are aligned antiparallel to the 
field. 
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The Distribution of Molecular Speeds 


From an experimental standpoint, it is often easier to measure the distribution 
of speeds than the distribution of energies. So let’s use the Maxwell-Boltzmann 
distribution for the distribution of kinetic energies in a gas to obtain the distribution 
of molecular speeds, which then can be tested in the laboratory. That is, we wish 
to obtain an expression for the number of molecules with speeds in the interval 
dv at v (between v and v+ dv), represented by dN = N(v)dv. The number of 
molecules dN in the interval is the same, whether we count them in terms of 
energy or speed, and so dN = N(E)dE = N(v)dv, or N(v) = N(E)dE/dv. With 
E = 4 mv’ and dE/dv = my, we have 


dE 2N my 2 27m\3/2 7 
= E = —mvy* /2kT = N,/— 2 ,—mv /2kT 
N(v) = N( oF JruTPAV 2 e mv Vz (=) ve 
(10.20) 





This equation, which is known as the Maxwell speed distribution, is graphed in 
Figure 10.9. The shaded strip shows the number dN in the interval dv at v. The 
most probable speed occurs at the value v, = (2kT/m)!/. 

An example of an experiment to measure the distribution of molecular speeds 
is shown in Figure 10.10. A small hole in the side of an oven allows a stream 
of molecules to escape; we assume the hole to be small enough so that the 
distribution of speeds inside the oven is not changed. The beam of molecules 
is made to pass through a slot in a disc attached to an axle rotating at angular 
velocity w. At the other end of the axle is a second slotted disc, but the slot is 
displaced from the first by an angle 6. In order for a molecule to pass through both 
slots and strike the detector, it must travel the length Z of the axle in the same 
time that it takes the axle to rotate by the angle 0, and thus L/v = 0/w. Keeping 
L and 6 fixed, we can vary w; measuring the number of molecules striking the 
detector for each different value of w enables us to measure the Maxwell speed 
distribution. A set of such experimental results is shown in Figure 10.11, and the 
agreement between the measured speed distribution and that predicted according 
to Eq. 10.20 is very impressive. 

From this example you can also see the importance of the interval dv. What we 
measure is always the product N(v) dv, and in this case the range of velocities is 
determined primarily by the width of the slots in the discs. To make dv very small, 


Oven 





Detector 


FIGURE 10.10 Apparatus to measure the distribution of molecular speeds. 
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FIGURE 10.9. The Maxwell speed 
distribution for gas molecules. The 
shaded strip represents the number of 
molecules with speeds between v and 
vt+dv. 
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FIGURE 10.11 Result of measure- 


ment of the distribution of atomic 
speeds in thallium vapor. The solid 
line is obtained from the Maxwell 
speed distribution for an oven tem- 
perature of 870 K. 


304 Chapter 10 | Statistical Physics 





FIGURE 10.12 In one dimension, the 
velocity component is restricted to 
the interval between v, and v, + dv,, 
which corresponds to the interval dn,. 
on the 7, axis. 


and thereby measure v “exactly,” we would need to make the slots very small, 
so that very few molecules could get through. For the “perfect” experiment, we 
make dv + 0 by making the slot width equal to 0, and no molecules get through 
the apparatus! 


Doppler Broadening of Spectral Lines 


Atoms at rest are expected to emit very sharp, narrow spectral lines; this “natural 
line width” is usually determined by the uncertainty principle AE At ~ hf, which 
makes the energy of the atomic excited states somewhat uncertain due to their 
finite lifetimes. Because the atoms of a gas are in motion, the frequency or 
wavelength of the light emitted by the atoms often can be Doppler shifted (see 
Eq. 2.22). Even if the container of gas is at rest relative to the observer, some 
of the molecules are moving toward the observer and will show an increased 
frequency, while others are moving away and will show a decreased frequency. 
As a result of the thermal motion, the spectral lines are broadened and show a 
width that can be much larger than the natural width. 

For thermal motion, which involves velocities much smaller than the velocity 
of light, we can ignore motion transverse to the line of sight and consider only 
motion parallel to the line of sight from the observer to the source. Instead of the 
speed distribution, for this purpose we need to know how one component of the 
velocity of the gas atoms, let’s assume it to be v,, is distributed as a function of the 
temperature. 

To find the velocity distribution, let’s start with the density of states appropriate 
for a one-dimensional gas of particles. In Figure 10.7 we constructed a three- 
dimensional coordinate system with the axes labeled n,,, Ny» and n,. To obtain the 
distribution function for v,, we need only a one-dimensional coordinate system 
with the n, axis (Figure 10.12). The “volume” available in one dimension for 
particles with energies in the range from E to E + dE is just the range dn, at n,. 
The density of states (per unit length) in one dimension is then 


2s+1 
ai dn,, 
L 





g(n,)dn, = (10.21) 


With E=p2/2m=n2h?/8mL* we can find v,=n,h/2mL and dv, = 
(h/2mL) dn,,, so 





2s +1 2mL 2m(2s + 1) 
— = ———— dv 


10.22 
ZT h Vy h x ( 0 ) 


g(vx) dv, = g(n,) dn, = 


The number of particles with velocities between v,. and v, + dv, is 


2mL(2s + 1) 


- en mel AKT gy (10.23) 


dN = NQ,) av, = Lg(v,) f(v,) dv, = 47 
where we have used the Maxwell-Boltzmann distribution function f(£) = 
Ale#/T with E = 5 mvz. To find the normalization constant 4~!, we again 
require that the total number of atoms is N: 


too 2mLQs +1) ft 
N= / dN = / Nv, dv, ne / en ma/AT gy (10.24) 
=—O0 — oO 
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Note that the integration limits are now —oco to +00, in contrast to the speed 
distribution in which the limits were 0 to oo. We must include all atoms in the 
integral—those that are moving toward the observer as well as those that are 
moving away. Evaluating the integral, we find A~! = Nhy/m/2nkT /2mL(2s + 1) 
and thus we have the Maxwell velocity distribution 


1/2 
m ) ey /2kT (10.25) 


ba Ge: 
This function is plotted in Figure 10.13. The shaded strip shows the number 
dN = N(v,)dv, having velocity components between v, and v, + dv,. This is a 
familiar curve known as the Gaussian distribution or the normal distribution 
(also called the “bell-shaped curve”), which has applications in many areas of 
probability and statistics. 

To complete the analysis of the Doppler broadening, we must change the 
distribution function so it describes frequency or wavelength. That is, let’s find 
out how many atoms will emit frequencies in the range from f to f + df, which 
we can write as dN = N(f)df. The relationship between frequency and velocity 
is given by Eq. 2.22. We’ll change notation somewhat and let /) represent the 
unshifted frequency while f represents the observed (Doppler-shifted) frequency. 


We then have 
1—v,/ce l-v,/e 
f= hol ye ae Zh 


where we have replaced the square root in the denominator with | because 
v, <c for thermal motions. Solving for v,, we obtain v, = c(1—/f/fy) and 
taking the magnitude of the differentials we then have |dv,.| = c df/fy. With these 
substitutions the number of atoms in the small interval becomes 





v,/c) (10.26) 


dN = N(fydf = N(v,)dv, = N ( 


mn <i 2p yes2 apr CF 
—me*(1—f /fy)* /2kT © MS 10.27 
oat) e fo 


and so the frequency distribution function is 


Ne m 
NON (oar 


ie ene” (If /fo)? /2kT (10.28) 


The frequency distribution function is shown in Figure 10.14. The broadening 
is usually characterized by giving the width of the spectral line Af, defined as 
the range over which the intensity falls to half the maximum value on either 
side. (This is known as the “full width at half maximum” or FWHM.) The only 
factor in Eq. 10.28 that depends on frequency is the exponential, and so the width 
is determined by the frequencies at which the exponential (which is equal to 1 
at f = fo) falls by half: emer If /fo)” /2kT — | /2 or, taking the logarithm of both 
sides, 


ROG icid _ 1/2 10.29 
-Fe (1-4) =inc172 (10.29) 
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FIGURE 10.13 The Maxwell veloc- 
ity distribution for gas molecules. The 
distribution is centered on v, = 0. The 
shaded strip represents the number of 
molecules with velocity components 
between v, and v, + dv,. 
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FIGURE 10.14  Doppler-broadened 
spectral line. The natural linewidth has 
been exaggerated for the drawing — 
typically the natural linewidth is no 
more than 10~> to 10~* of the broad- 
ened linewidth. 
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which can be solved to give f = fy(1 4 /(21n2)kT/mc7). The two solutions 
(corresponding to the + and — signs) give the two points at which the distribution 
falls to half its maximum. The interval between those two points is 


Af = 2fyV(2 In 2)kT /me (10.30) 


We can write a similar expression for the wavelength broadening because 
Af /fo = Ad/do. The Doppler broadening is directly related to the temperature, 
and so a measurement of the width of spectral lines provides a way to determine 
the temperature of the emitting atoms. This is a powerful means of determining 
the temperatures of stars from observing the widths of their spectral lines. 


10.5 QUANTUM STATISTICS 


As we discussed in Section 10.2, the distribution functions for the indistinguishable 
particles of quantum physics are different from those of classical physics. Because 
of the unusual behavior of quantum systems, we must have separate distribution 
functions for particles that obey the Pauli exclusion principle (such as electrons) 
and particles that do not obey the Pauli principle (such as photons). We will not 
derive these distribution functions, but merely state them and discuss some of 
their properties and their applications. 

Particles that do not obey the Pauli principle are those with integral spins 
(0, 1, 2, ..., in units of 7). Their statistical properties are determined by the 
Bose-Einstein distribution function: 


Spr (2) = AppetT —1 (10.31) 


Particles described by this distribution are known collectively as bosons. The 
constant App serves as a kind of normalization constant, in analogy with the factor 
A in the Maxwell-Boltzmann distribution (and the comparison shows why we 
included this factor as A~! in Eq. 10.16). 

Particles of half-integral spin G; 3, ...) that obey the Pauli principle, such as 
electrons or nucleons, are described by the Fermi-Dirac distribution function: 


1 


Srp (£) = A ppee/AT +1 


(10.32) 


These particles are known collectively as fermions. 

How the minor change in sign in the denominator between fpr and ffp 
gives such a radical change in the form of the distribution function is not 
immediately obvious, and to show the differences we need to know more about 
the normalization coefficient App, which is not a constant but depends on 7. 
For the Bose-Einstein distribution, in most cases of practical interest App is 
either independent of T or depends so weakly on 7 that the exponential term 
e#/kT dominates the temperature dependence. However, for the Fermi-Dirac 


distribution, App is strongly dependent on 7, and the dependence is usually 
approximately exponential, so App is written as 


App = @ FEI (10.33) 


and the Fermi-Dirac distribution becomes 


1 
Sep(E£) = eE-E/IT 4 1 (10.34) 
where EF; is called the Fermi energy. 

Let us look qualitatively at the differences between fgp and fgp at low 
temperatures. For the Bose-Einstein distribution, assuming for the moment App = 
1, in the limit of small 7 the exponential factor becomes large for large E, and so 
Jer — 9 for states with large energies. The only energy levels that have any real 
chance of being populated are those with E = 0, for which the exponential factor 
approaches 1, the denominator becomes very small, and fg; — oc. Thus when T 
is small, all of the particles in the system try to occupy the lowest energy state. 
This effect is known as “Bose-Einstein condensation,” and we will see that it has 
some rather startling consequences. 

This effect is not possible for fermions, such as electrons. We know that the 
electrons in an atom, for example, do not all occupy the lowest energy state, no 
matter what the temperature. Let us see how the Fermi-Dirac distribution function 
prevents this. The exponential factor in the denominator of frp is e®-#")/"", For 
values of E > E;, when T is small the exponential factor becomes large and fpp 
goes to zero, just like fap. When E < E,, however, the story is very different, for 
then E — E, is negative, and e@—#F)/!" goes to zero for small T, so fpp = 1. The 
occupation probability is therefore only one per quantum state, just as required by 
the Pauli principle. Even at very low temperatures, fermions do not “condense” 
into the lowest energy level. 

In Figures 10.15 to 10.17, the three distributions /yp, fpr. and fpp are plotted 
as functions of the energy E. (Note the qualitative similarity with Figures 10.4 to 
10.6.) You can see, by comparing these figures, that all of the distribution functions 
fall to zero at large values of E; when E > kT, the occupation probability is 
very small, as we calculated for fp in the case of the first excited state of the 
hydrogen atom in Example 10.3. Notice also that, even though /\;p becomes large 
for small £, it remains finite. The Bose-Einstein distribution, fgp, on the other 
hand, becomes infinite as E — 0; this is the “condensation” effect referred to 
earlier, in which all of the particles try to occupy the lowest quantum state. 


You can see that fpp never becomes larger than 1.0, just as we expect for 
particles that obey the Pauli principle. The function fp has the value 1.0 for states 
with low energy (all states are filled), and it falls quickly to zero at high energy (all 
states are empty). The Fermi energy Ly gives the point at which the distribution 
function has the value 1/2. At absolute zero, all states below Ey are filled and all 
states above E, are empty. 

The normalization constant ultimately depends on the number of particles 
in the system, determined by integrating the distribution function f(E£) after 
multiplying it by the density of states g(£). Note how changing the number 
of particles changes the normalizations of the different distributions. For the 
Maxwell-Boltzmann distribution, increasing N increases the area under the curve 
by raising the intercept, thus raising the entire curve. For the Fermi-Dirac 


10.5 | Quantum Statistics 307 





fmp(E) 











FIGURE 10.15 The Maxwell- 
Boltzmann distribution function. 
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FIGURE 10.16 The Bose-Einstein 
distribution function. 
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FIGURE 10.17 The Fermi-Dirac dis- 


tribution function. 


308 Chapter 10 | Statistical Physics 


ME) 

















FIGURE 10.18 The occupation prob- 
ability of electrons in an electron gas 
(a) at T =O and (b) at T>0. The 
solid dots represent filled states and 
the open dots represent empty states. 
Each energy level can hold a maxi- 
mum of 2 electrons (spin up and spin 
down). 


distribution, on the other hand, increasing N increases the area by widening the 
curve to the right (increasing E;) while keeping the intercept at 1.0. 

Let’s consider a gas of electrons described by the density of states function 
g(E) given in Eq. 10.12 and thus with V(E) = Vg(£)fpp(£). Figure 10.18a shows 
a hypothetical set of energy levels and how they would be populated at T = 0 
(with 2 electrons in each quantum state). As T increases, some levels above Ey 
are partially occupied (fpp > 0), while some levels below Ey are partially empty 
(fp < 1). Figure 10.18b shows how the energy levels of a system might be 
populated at T > 0. The higher the temperature, the more the distribution spreads, 
but notice that only states in the vicinity of Ey are affected. The states at much 
lower energies remain filled, and those at much higher energies remain empty. 

The Fermi energy varies only slightly with temperature for most materials, and 
we can regard it as constant for many applications. As we will see in Section 10.7, 
for electrons in a metal E, depends on the electron density of the material, 
which doesn’t change much with temperature. For some materials, notably 
semiconductors, the density of conduction electrons can change significantly with 
temperature, and thus £; in these materials is temperature dependent. 


Limit of Classical Statistics 


Under what circumstances can we treat a system classically rather then according 
to the laws of quantum mechanics? The quantum behavior can be neglected if the 
de Broglie wavelength of a particle is much smaller than the physical separation 
between the particles. That is, no particle lies within the wave packet of its 
neighbors. If we take kT as a representative measure of the kinetic energy of a 
particle in a collection of particles at temperature 7, then with p”/2m = kT we 
obtain the de Broglie wavelength as 


h h he 


 p fomkT JS 2mce2kT 





(10.35) 





The density N/V gives the number of particles per unit volume, and so the 
average spacing d between particles is about (V/V)~!/3. The condition for the 
applicability of classical physics is then A < d or A/d < 1, which gives 


A he/V2mePkT _ he(N/V)'? 
d= (N/V)-3  /amezkT 





(10.36) 


The normalization constant we found from Eq. 10.18 for the Maxwell-Boltzmann 
distribution can be written 





NG IBY 
~ V(2s + 1)(2ame2kT)3/2 (2s + 1)03/2 |. /ome2kT mee 


The quantity in brackets on the right is just A/d from Eq. 10.36. Apart from 
some small factors of order unity, Eq. 10.36 is equivalent to requiring that the 
normalization constant of the Maxwell-Boltzmann distribution is small: 4~! <« 1, 
or that the number of occupied states in the gas is much smaller than the number 
of available states. 
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10.6 APPLICATIONS OF BOSE-EINSTEIN 


STATISTICS 





Thermal Radiation 


As we did in our discussion of thermal radiation in Chapter 3, we consider a 
cavity filled with electromagnetic radiation. For this calculation, we assume the 
box to be filled with a “gas” of photons. Photons have spin 1, so they are bosons 
and obey Bose-Einstein statistics. 

The normalization parameter App of the Bose-Einstein distribution, Eq. 10.34, 
depends on the total number of particles described by the distribution. Because 
photons are continuously created and destroyed as radiation is emitted or absorbed 
by the walls of the cavity, the total number of particles is not constant, and the 
parameter App loses its significance. We can eliminate this factor from the 
Bose-Einstein distribution by setting App = | in Eq. 10.34. 

The number of photons having energy in the range E to E + dE is, according 
to the Bose-Einstein distribution and using the density states for the photon gas 
from Eq. 10.15, 

1 


8 
7 p—- dE (10.38) 


dN = N(E)dE = Vg(E)fge(E) dE = "he oT — | 





The radiant energy carried by photons with energy between E and E +dE is 
EdN = EN(E)dE, and the contribution to the energy density (energy per unit 
volume) of photons with energy E is 


EN(E)dE 8xE° 1 
Vo (hey3 eB kT — 1 





u(E) dE = dE (10.39) 


The total energy density over all photon energies is 





co EB E kT 4 co 3 
820 / d 82(kT) / x? dx (10.40) 
0 0 


u=f u(E) dE = (hey = ar 


eF/KT 1 (he)3 
where x = E/kT. The integral is a standard form and has the value 24/15, so 


Se 


= aGee (10.41) 


This is identical with Stefan’s law (Eq. 3.26) using the Stefan-Boltzmann constant 
from Eq. 3.42 and accounting for the factor c/4 that takes us from energy density 
of the radiation to radiant intensity /. 

We can show that our expression for the energy density leads to Planck’s 
equation for the intensity of cavity radiation by changing variables from E to i. 
Substituting E = he/d and |dE| = (hc/d7)dd, we find 


8the 1 
u(A) dr = u(E) dE = a gepF —1 dx (10.42) 
Multiplying by c/4 to convert from the energy density of the radiation to the 
intensity, we obtain the result that was given in Eq. 3.41. 
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FIGURE 10.19 The specific heat of 
liquid helium. The discontinuity at 
2.17 K is called the lambda point. 





FIGURE 10.20 Liquid helium can be 
seen dripping from the bottom of this 
container, as a result of a thin film 
flowing up and over the walls from 
the liquid inside the container. 


Thus the Planck theory of blackbody radiation, which was so successful 
in accounting for experimental results, can be derived from the Bose-Einstein 
distribution for photons (but Planck’s original work was done two decades before 
the development of Bose-Einstein statistics). 


Liquid Helium 


One of the most remarkable substances we can study in the laboratory is liquid 
helium. Here are some of its properties: 


1. Helium gas is the most inert of the inert gases. Under normal conditions, 
it forms no compounds, and it has the lowest boiling point, 4.18 K, of any 
material. 

2. Just below its boiling point of 4.18 K, helium behaves much like an ordinary 
liquid. As the helium boils, the escaping gas forms bubbles, like a boiling 
pot of water. As the liquid is cooled further, a sudden transition occurs at 
a temperature of 2.17 K: the violent boiling stops, and the liquid becomes 
absolutely still. (Evaporation continues, but only from the surface.) 

3. As the liquid is cooled below 2.17 K, the specific heat and the thermal 
conductivity both increase suddenly and discontinuously. Figure 10.19 shows 
the specific heat as a function of temperature. The form of the figure looks 
rather like the Greek letter A, and so the transition point at 2.17 K has become 
known as the lambda point. The thermal conductivity rises at the A point by a 
factor of perhaps 10°. 

4. Above 2.17 K, liquid helium can be held in a vessel with a porous plug in 
the bottom. As soon as the liquid is cooled below 2.17 K, the liquid begins to 
flow easily through the plug. 

5. Below the lambda point, liquid helium has the power to seemingly defy 
gravity, flowing up and over the walls of its container. The helium forms a 
thin film, which lines the walls of the container; the remaining liquid is then 
drawn up by the film like a siphon, and the helium can be seen dripping from 
the bottom of the container, as in the photograph of Figure 10.20. 


All of these strange properties occur because liquid helium obeys Bose-Einstein 
statistics. Ordinary helium has two electrons filling the 1s shell, so the total angular 
momentum of the electrons is zero. It happens that the helium nucleus (alpha 
particle) also has a spin of zero. Therefore the total spin of the atom (electron 
spin + nuclear spin) is zero, and a helium atom behaves like a boson. At 2.17 K, 
a change of phase occurs in the helium liquid. Above the lambda point, helium 
behaves like an ordinary liquid; below the lambda point, liquid helium begins 
to become a superfluid. As the temperature is decreased from the lambda point 
toward absolute zero, the relative concentration of the normal fluid decreases and 
that of the superfluid increases. The unusual properties of liquid helium are all 
caused by the superfluid component, which is also known as a quantum liquid. 
Because the helium atoms obey Bose-Einstein statistics, the Pauli principle does 
not prevent all of the atoms from being in the same quantum state. This begins 
to happen at the lambda point. We can think of the superfluid as being a single 
quantum state made up of a very large number of atoms; the atoms behave in a 
cooperative way, giving the superfluid its unusual properties. 

By way of comparison, if we try the same kinds of experiments with the 
rarer isotope of helium, *He, the behavior is very different. Although *He has 
zero electronic spin, just like *He, it has only three particles rather than four in 
its nucleus, and its nuclear spin is 2. The total atomic (electronic + nuclear) 
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spin is therefore 12, and *He behaves like a fermion and obeys the Fermi-Dirac 
distribution. Because the Pauli principle prevents more than one fermion from 
occupying any quantum state, no superfluidity is expected for *He, and indeed 
none is observed until *He is cooled to about 0.002 K. At this point the weak 
coupling of two *He to form a boson occurs, and the *He pairs can display 
the effects of Bose-Einstein statistics. (A related effect involving the pairing of 
electrons is responsible for superconductivity; see Section 11.5.) 


Bose-Einstein Condensation 


Let’s consider the expression for the total number of particles of a system of 
bosons in a volume V. We can treat the bosons as a quantum system similar to 
the electron gas—particles with wave functions that vanish at the boundaries of 
the volume. The density of states is then given by Eq. 10.12 and the total number 
of particles in the volume V is then 


N= fa = [ve ae= | Ve(E)fpp(E) dE 
0 0 


2s + 1)4a/2 V3? 7 E}/2 
7 a 0 Appek/AT — J 





dE (10.43) 


Previously our approach to an equation of this type was to evaluate the integral 
and solve the resulting equation for the constant A, which provides the normal- 
ization to make the total number of particles equal to N. That procedure poses 
some difficulties for this integral, so we’ll try a different approach: we’ll see 
what Eq. 10.43 tells us about the maximum number of particles that can be 
accommodated in the volume V. Because App is a pure number that is always 
> +1 (otherwise the denominator of fg, could become negative, which makes no 
sense for a distribution function), we can find this maximum value by making the 
denominator in the integral as small as possible, that is, by putting App = 1. The 
resulting integral has the value 1.306z'/*(kT)>/*, and the maximum number of 
particles is then 





a) 


N =2.612V (2s + 1) ( az 


(10.44) 


It appears that we can violate this maximum limit by either (1) putting more parti- 
cles into the volume V than Eq. 10.44 permits, or (2) lowering the temperature (and 
thus reducing the maximum JV) so that the actual number of particles in the system 
becomes greater than the maximum limit given by Eq. 10.44 for that temperature. 
How can we still refer to a “maximum” number of particles in these cases? 

To resolve this apparent difficulty, we must look more carefully at what happens 
for E = 0. Clearly the Bose-Einstein distribution function fp, (Z) becomes infinite 
for E = 0 when App = 1. The integral in Eq. 10.43 doesn’t blow up at E = 0 
because the numerator E'/? is zero at E = 0. But there is something very wrong 
with that restriction, which requires that the density of states (Eq. 10.12) be zero 
at E = 0. Our system must have a ground state, so there is at least one state at 
E = 0. This contradicts the calculation which puts g(0) = 0. 

If we try to put more particles into the system than the maximum given by 
Eq. 10.44 (or equivalently if we try to reduce the temperature below its limit for a 
given NV), the additional particles all can go into the E = 0 ground state, which is 
not subject to the restriction on the maximum value of NV. This is Bose-Einstein 
condensation—all excess particles are “condensing” into the ground state. Note 
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that the use of the word “condensation” to describe this effect does not refer 
to a gas condensing into a liquid. The particles are “condensed” into the same 
quantum state, where they are all described by the same wave function, but they 
are all still in the gaseous state. 

What we are thus actually calculating in Eq. 10.44 is the number of particles 
in all states except the ground state, that is, the number in all the excited states. 
Let’s call that value N.,. It is this value that is limited by the restriction on the 
maximum number of particles that can be accommodated by the Bose-Einstein 
distribution. The number of particles in the ground state, No, has no restriction. 
The total number of particles is Niotaj = No + Nox: 

Let’s solve Eq. 10.44 for the critical temperature at which we expect this 
condensation to occur: 


Now/V 7? he 
a ion 10.45 
i (=a 4) 2nmk ( ) 





Above this temperature, all of the particles can be in excited states of the system 
without restriction. When the temperature is reduced to Tprc, the excited states 
are all fully populated, and any further reduction of the temperature below this 
value means that particles must be transforming from excited states into the 
ground state. With N., as the number of particles calculated in Eq. 10.44, we can 
combine that equation with Eq. 10.45 to give N.x/Niotai = (T/Tppc)*/”, or 


3/2 
No =1 ( f ) (10.46) 


Ni total qT BEC 





This applies only to temperatures at or below Tppc. At T = Tgp¢c, Ny = O—all 
of the particles are in the excited states. As T is reduced below Tpxc, the fraction 
No/Niota Increases, approaching | (all particles in the ground state) as T — 0. 
This is the Bose-Einstein condensation. 

Einstein first predicted this effect in dilute gases in 1925, but it took 70 years 
until the first experiments were done in 1995. The reason for this is apparent from 
considering the temperature necessary to observe the condensation. If we start 
with an ordinary gas at room temperature (with a density of around 2.4 x 10°° 
molecules/m?>), then Eq. 10.45 gives a temperature of around 0.001 K = 1 mK. 
This is the temperature at which the condensate begins to form, and to have a 
significant number of particles in the condensate we must be well below this 
temperature. It is clear that very low temperatures are required to observe the 
condensate. However, even if the gas molecules were only weakly interacting, 
a gas at ordinary densities would become a liquid at these temperatures. It is 
therefore necessary to work with gases at extremely low densities, and from 
Eq. 10.45 you can see that as we reduce the density, the temperature necessary to 
observe the Bose-Einstein condensation becomes even smaller. 

To avoid the gas condensing into a liquid, we want to molecules to be very 
far apart (corresponding to a very low density). How far apart must they be? 
In an ordinary gas, the mean free path (average distance between collisions) is 
the order of 100 molecular diameters. To avoid molecules from colliding and 
therefore sticking together (which might trigger the formation of the liquid), let’s 
assume the spacing between molecules must be the order of 100 times larger than 
it is in an ordinary gas, which means the density must be smaller by a factor of 
(10-7)? = 10~°. From Eq. 10.45 we see that if the density is smaller by 10~°, 
then Tgpc will be smaller by a factor of (10~°)?/3 = 10~*. Thus instead of a 
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temperature of 1 mK, the observation of a Bose-Einstein condensation requires 
temperatures of the order of 100 nK. 

Such incredibly low temperatures require extraordinary means to produce, 
and that is why it took 70 years to observe the first Bose-Einstein condensate. 
Several successful experiments have been done since 1995, and the experiments 
generally use a combination of Jaser cooling and magnetic cooling to achieve 
these temperatures. In laser cooling, a collection of gas atoms is illuminated with 
a laser beam that is tuned to one of the atomic absorption frequencies. A gas atom 
that happens to be moving toward the laser will absorb a photon and slow down. 
However, a gas atom that is moving away from the laser will absorb a photon and 
speed up. How can this result in an overall slowing of the gas atoms? 

The trick in laser cooling is to take advantage of the Doppler broadening of the 
absorption due to the distribution in the velocities of the gas atoms. The laser beam 
is tuned so that its frequency is a bit below the central frequency of the broadened 
peak. Frequencies smaller than the central frequency correspond to atoms moving 
toward the laser beam; such atoms can absorb at the frequency of the laser and 
thus slow down. Atoms moving in the opposite direction cannot absorb at that 
frequency (because their Doppler shifts are opposite) and so are not affected. If a 
second laser beam, also tuned below the central frequency, illuminates the atoms 
from the opposite side, then atoms moving in either direction will be slowed and 
therefore cooled. In practice, the gas is illuminated by lasers in all six directions 
to achieve cooling by slowing the atoms in three dimensions. The excited state 
formed by absorbing the photon will decay back to the ground state by emitting a 
photon, but the emission occurs in random directions and so doesn’t change the 
velocity distribution of the atoms. 

Laser cooling by itself is insufficient to achieve the temperatures necessary for 
Bose-Einstein condensation. It is also necessary to use some form of magnetic 
cooling. For example, suppose we confine the atoms in a region in which there 
is a magnetic field that is produced by a set of coils. The atoms are moving very 
slowly and do not have enough energy to escape the potential energy barrier that 
is established by the magnetic field. If the magnetic field strength is then reduced 
slightly, the more energetic atoms can escape. The remaining atoms still confined 
by the (weaker) magnetic field are the ones with smaller kinetic energies, and thus 
they have a lower temperature. With each lowering of the field strength, the more 
energetic atoms escape and the remaining gas cools. This is similar to evaporative 
cooling of a cup of hot coffee—the faster-moving atoms are the most likely to 
leave the liquid, and the remaining atoms have a smaller average kinetic energy 
and thus a lower temperature. 

The first observations of a Bose-Einstein condensation were reported in 1995 
by Eric Cornell and Carl Wieman using Rb vapor and by Wolfgang Ketterle using 
Na vapor. Figure 10.21 shows the velocity distribution illustrating the formation 
of the condensate in Rb vapor from the original work of Cornell and Wieman. 
At a temperature of 400 nK, the distribution shows a broad peak corresponding 
to a Maxwell velocity distribution centered at v = 0. At 200 nK, a narrow peak 
is superimposed on top of the Maxwell distribution at v = 0. This represents the 
atoms all moving with the same speed, as would be expected for a condensate with 
a large number of atoms in the same state of motion. At still lower temperatures 
(50 nk), the Maxwell distribution has disappeared, so that nearly all of the atoms 
are in the condensed ground state, consistent with Eq. 10.45. 
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FIGURE 10.21 Bose-Einstein condensation in Rb atoms. The graphs show a repre- 
sentation of the velocity distribution at 400 nK (left), 200 nK (center), and 50 nK 
(right). At 400 nK, there is a broad Maxwellian distribution, but as the temperature is 
reduced, the molecules condense into a single quantum state characterized by a much 
narrower velocity profile. 


For the experimental observations of the Bose-Einstein condensation, Cornell, 
Wieman, and Ketterle shared the 2001 Nobel Prize in physics. 


10.7 APPLICATIONS OF FERMI-DIRAC STATISTICS 


Now let’s consider some applications of Fermi-Dirac statistics. We'll discuss 
several different systems consisting of spin-' particles: electrons in metals, 
electrons and neutrons in stars, and *He in liquid *He. 


The Free Electron Model of Metals 


In a metal, the valence electrons are not very strongly bound to individual atoms, 
and consequently they travel rather freely throughout the volume of the metal. 
We can treat these electrons as a “gas” that obeys the Fermi-Dirac distribution, 
with a density of states given by Eq. 10.12. The number of electrons with energies 
between F and F + dE is then 




















r=0 8/232 

e T = 300 K dN = N(E) dE = Ve(E)fgp(£) dE = V—, BE” eE-Epy/kT 4 | dE 
= kT | (10.47) 
Figure 10.22 shows a graph of N(E). Note that the energy AT is only a small 
interval compared with the range of occupied energy states. When the temperature 
0 ] 2 3 of the metal is increased from 0 K to 300 K (room temperature), only a very small 
E (eV) fraction of the electrons is affected—a small number move from filled states just 


below E, to formerly empty states just above Ep. 
We can find a numerical value for E; at T = 0 by normalizing Eq. 10.47 so 
that the sample contains a total number N of these free electrons: 


FIGURE 10.22 The number of occu- 

pied energy levels for electrons at T = 

0 and 7 = 300K, according to the 

Fermi-Dirac distribution. The Fermi oe 82V /2m3/2 E}/2 
n=|[dn=]| N(@dE= 7 

energy Ey is chosen to be 3.0 eV. 6 3 9 eh Ery/AT 4 





; dE (10.48) 
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At T = 0, the Fermi-Dirac distribution function has the value 1 for E < E, and 0 
for E > Ey, so the integral reduces to 


82V /2m3/? FF 1 82V /2m3/? 23/5 
N = —. E\? dE = ———_“ 10.49 
h3 [ h3 3 F ( ) 
Solving for E,, we obtain 
he ( 3N \7 
Ey =—|=— 10.50 
Fr 2m (sr) en) 
We can also find the mean or average energy of the electrons 
1 co 
En xf EN(E) dE (10.51) 
N Jo 
and it is left as an exercise to show that 
Em = 2Ep (10.52) 


| Example 10.8 





Compute the Fermi energy E, for sodium. 


Solution h? (3 N 
Each sodium atom contributes one valence electron to the Ep = Im \ 82 V 
metal, and so the number of electrons per unit volume, 


) 


2/3 hc 


~~ Amc? 





3 





N/V, is equal to the number of sodium atoms per unit (1240 eV - nm)? ( 


volume. This in turn can be found from the density p and ~ 2(0.511 x 10° eV) 


the molar mass M of sodium: 
= 3.15eV 


N_ pN,_ (0.971 x 10° kg/m?)(6.02 x 1073 atoms/mole) 
VM 0.023 kg/mole 





820 


The average energy of the valence electrons is 
1.89eV. Even at the absolute zero of temperature, the 


3N 2/3 
(ir) 


The Fermi energy now can be found from Eq. 10.50: 


2/3 
2.54 x 1078 mn) 


3 
5 


= 2.54 x 1078 m-3 electrons still have quite a large average energy. 


Ey or 





From Figure 10.22 we see that the change in N(E) between T = 0 and 
T = 300K (room temperature) is relatively small, and so these values for EZ, and 
E,, are approximately correct at room temperature. 

The meaning of these numbers is as follows. Instead of isolated atoms with 
individual energy levels, we consider the metal to be a single system with a 
very large number of energy levels (at least as far as the valence electrons 
are concerned). Electrons fill these energy levels, in accordance with the Pauli 
principle, beginning at FE = 0. By the time we add 2.54 x 10? valence electrons 
to 1 cm? of sodium, we have filled energy levels up to E; = 3.15 eV; all levels 
below £y are filled and all levels above E; are empty. Electrons have an almost 
continuous energy distribution (the levels are discrete, but they are very close 
together) from E = 0 to E = Ey, with an average energy of 1.89 eV. AtT = 300K, 
a relatively small number of electrons is excited from below Ey to above Ey; 
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the range over which electrons are excited is of order kT = 0.025 eV, so that only 
electrons within about 0.025 eV of Ey are affected by the change from T = 0 to 
T = 300K. 

In a similar fashion, if we apply a modest electric field to a metal, the only 
effect is to change the state of motion of a relatively small number of electrons 
near the Fermi energy. Most of the electrons can’t be affected by the electric field, 
because all of the nearby states are already filled. In Chapter 11, we'll discuss the 
heat capacity and the electrical conductivity of metals based on the Fermi-Dirac 
distribution of the electrons. 


White Dwarf Stars 


A star like the Sun has a constant radius because the outward pressure due to 
the radiation traveling from the center (where the fusion reactions take place) 
balances the inward gravitational force that tends toward collapse. Eventually 
the hydrogen fuel will be converted to helium, the rate of fusion reactions will 
decrease, and gravity will take over. The Sun will collapse to a smaller and smaller 
radius, until further contraction is stopped by the Pauli principle. This is the white 
dwarf stage of stellar evolution. 

Let’s consider a star of mass M to be composed originally of hydrogen, with 
equal numbers of protons (hydrogen nuclei) and electrons. (The star is too hot for 
atomic hydrogen to form, so we consider the star to be a “gas” of protons and 
a “gas” of electrons occupying the same spherical volume.) After the hydrogen 
has been converted into helium, the star will contain N electrons and V/2 helium 
nuclei (alpha particles). The helium nuclei are bosons, so the Pauli principle does 
not apply to them during the collapse. The collapse ends when the electrons 
cannot be forced closer together because the Pauli principle would be violated. At 
that point, all of the electron energy levels are filled from 0 to the Fermi energy. 
The average energy E,, of the electrons is 2Ep, as given by Eq. 10.52, and so the 
total energy of N electrons is 
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assuming the electrons to be distributed uniformly throughout a sphere of radius 
Rand volume V = $7 R?. 

The total gravitational energy of the star can be found from the mass distribution 
of the helium (the electron mass is negligible in comparison with the helium 
mass). For simplicity, we assume the star to be of uniform density. The result (see 








Problem 38) is 
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The star collapses until its energy reaches a minimum value, at which point we 
can find its radius by setting dE/dR equal to 0 and solving the resulting equation, 
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which gives 
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Let’s consider the white dwarf star Sirius B, which has a mass of 2.09 x 10°° kg 
(about 5% greater than the mass of the Sun). Equation 10.56 gives a radius 
of 7.2 x 10°m for Sirius B. The measured radius is about 5.6 x 10°m. The 
difference between the calculated and observed values is probably due mostly 
to the relativistic motion of the electrons. Our calculation of the Fermi energy 
assumed the electrons to move nonrelativistically. The Fermi energy of Sirius 
B is about 200 keV, which means that the kinetic energy of electrons near 
the Fermi energy is not small compared with the rest energy (511 keV). We 
also oversimplified the structure of the star by assuming it to be of uniform 
density (which was necessary to obtain Eq. 10.54 for the gravitational energy). 
Nevertheless, even this very rough calculation gives us a good approximation to 
the properties of white dwarf stars and demonstrates another system in which 
Fermi-Dirac statistics can be applied. 

Note that the radius of the white dwarf is comparable to the radius of the Earth; 
that is, the white dwarf has the mass of the Sun but the radius of the Earth. The 
average density of Sirius B is about 10° kg/m3, which is about one million times 
the average density of objects on Earth. The white dwarf is indeed an extreme 
state of matter! 

If we treat the electrons relativistically, we can obtain an estimate for the mass 
of the star at which the Pauli principle applied to the electrons is not able to prevent 
gravitational collapse. This value, which is called the Chandrasekhar limit, is 
about 1.4 solar masses. For stars with greater masses, the extreme density forces 
the protons and electrons to combine into neutrons until the star collapses into 
a neutron star, composed entirely of neutrons. Because neutrons obey the Pauli 
principle, we can apply Fermi-Dirac statistics to analyze the properties of neutron 
stars (see Problem 27). From a calculation similar to that for the white dwarf, we 
can find the radius at which the energy of a neutron star is at a minimum: 


h ae cis 
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In this equation, N refers to the number of neutrons in the star. Fora star of 1.5 solar 
masses, the radius would be about 11 km with a density of about 5 x 10!7 kg/m’. 

Neutron stars are commonly observed as pulsars in which the magnetic field 
of the neutrons traps electrons outside the neutron star, and the rapid rotation of 
the neutron star causes a beam of electromagnetic radiation from the accelerated 
electrons to sweep past the Earth somewhat like the rotating light in a lighthouse. 
For stars heavier than about 6 solar masses, not even the Pauli principle applied 
to the neutrons can prevent further gravitational collapse, and the star will either 
explode as a supernova or collapse to a black hole. 


The Heat Capacity of Dilute Solutions of 
3He in He 


Helium has two stable isotopes, *He and *He. The isotope *He is very rare 
(about 10~° in abundance relative to *He) in natural He gas. The two isotopes 
are chemically identical and have the same electronic structure, but differ in their 
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FIGURE 10.23 The region near Ep, 
showing N(E) at T = 0 and at tem- 
perature 7. When the temperature is 
raised from 0 to 7, the dN particles in 
the shaded region move to the corre- 
sponding region above Er, increasing 
their energy by 2¢ in the process. 


atomic masses (*He has a mass of about 3 u and *He is about 4 u). The difference 
comes about as a result of their nuclei: the nucleus of *He contains 2 protons and 
1 neutron, while the nucleus of “He contains 2 protons and 2 neutrons. Protons, 
neutrons, and electrons all have a spin of 4. In 4He, the 2 electrons combine to 
a total spin of 0, as do the 2 protons and the 2 neutrons. The total spin of *He is 
therefore 0. In *He, the 2 electrons combine to give a spin of 0 as do the 2 protons, 
but with only 1 neutron the total spin of *He is '. As a result, tHe behaves like a 
boson and *He like a fermion. 

As discussed above, *He becomes a superfluid at temperatures below 2.17 K, 
while *He does not. In a dilute mixture of liquid *He and *He below 2.17 K, 
the *He serves as a mostly inert background medium for the *He, and so we can 
treat the *He as a dilute “gas” of fermions, just as we treated the electron gas in 
analyzing metals. 

The heat capacity of a dilute mixture of He in *He is relatively straightforward 
to measure, so let’s try to calculate the heat capacity using the Fermi-Dirac 
distribution to describe the *He. Starting with fermions at 7 = 0, we add energy 
until the collection is at temperature 7. Because all of the energy states below 
Ey are filled at T = 0, most of the particles are not able to absorb any additional 
energy. A particle with energy far below E, cannot absorb energy of the order of 
kT and move to an empty state, because there are no empty states nearby. The only 
particles that can change their state are those within a small energy range of about 
kT at Eg, as shown in Figure 10.22. In going from temperature 0 to temperature 
T, only a relatively small number of particles moves from just below Ey to just 
above Er, with the rest of the particles remaining in their same energy states. 

Figure 10.23 shows a greatly magnified view of the region around Ey. As the 
temperature is raised from 0 to T, the particles in the region just below E; move to 
fill states just above Ef. In particular, consider the small number of particles dN in 
the narrow region of width dE located a small energy —e = E — Ey below Ey. The 
particles fill states in that region at T = 0, but when the temperature is raised to T 
they move to fill states in the corresponding region at an energy ¢ above Ey. Each 
particle in that narrow interval thus gains an energy of 2¢, and the total energy 
gained by all the particles in that narrow strip is dE, = 2edN. The strip has height 
N.x given by the difference between N(£) at T = 0 and N(£) at temperature T: 


Nex = NE, T = 0) — NE,T) = Ve! — fp] (10.58) 


where we have used N(E) = Vg(E)fpp(E) and fpp = | for T = 0. The width 
of the strip is dE, so the number of particles in the strip is dN = N.,dE. With 
—e = E— E, and |de| = |dE|, we obtain the energy gained by the particles in 
the narrow strip: 





I 
dE, = 2¢ dN = 2eN,, de = 2eVg(Ep — €) (: Fay :) de (10.59) 


The energy difference ¢, which we defined as the energy of the strip below Ey, 
runs from EL, (where E = 0) to 0 (where E = £;,). The total excitation energy of 
all of the particles that are excited from below E, to above Ef is 


o 1 
E..= | dE. =2V E 1 d. 10.60 
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We can simplify the calculation by noting that g(£) is a very slowly varying 
function compared with fgp in the region near E;, and even though we are 
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integrating from Ey to 0 the integral is nonzero only in a region very close to Ey. 
We can therefore take g(E; — ¢) = g(E,) and bring it out of the integral. Again 
because the integrand is nonzero only in a very small region, we can replace the 
lower limit on the integral by oo. 


1 


The heat capacity is defined as C = dE.,/dT. The derivative with respect to T 
can be moved inside the integral, and so 


dE, 0 —e &/kT E 
c= ae = 2Ve(E) f | ary 1)2 Ca = 
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= TSF AT —~ d 10.62 
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where x = ¢/kT. The integral is a standard form that has the value 27/6. Putting 
in the value for g(E,) from Eqs. 10.12 and 10.50, we finally obtain 


= m2k2NT 


10.63 
i (10.63) 


This equation predicts that the heat capacity of a dilute gas of fermions at low 
temperature should be proportional to 7. Figure 10.24 shows the low-temperature 
heat capacity of a dilute mixture of 5% *He in *He, and you can see how 
well the data agree with the prediction. The relationship is indeed linear in 
T at low temperature. This same behavior also describes the low-temperature 
heat capacity of metals, in which the electrons can also be treated as a dilute 
gas of fermions, but as we will see in the next chapter the contribution of the 
atoms to the heat capacity can often be much larger than the contribution of the 
electrons. 

Equation 10.63 predicts that the slope of the plot of C against T should be 
m7k?N/2E,, which works out to be 1.24J/K? for the experiment that obtained 
the results shown in Figure 10.24 (a 5% mixture of *He in “He and a total of 0.5 
mole of liquid). The measured slope is 3.11 J/K?, which differs from the expected 
slope by a factor of 2.5. The discrepancy comes about because we treated the *He 
atoms like those of a gas, in which the particles move freely. However, when *He 
moves through “He, there are viscous and other forces that act on the atoms. We 
can account for the difference by assigning the He an “effective mass,” which is 
greater than its actual mass; the greater mass simulates the sluggish behavior of 
the >He atoms as they move through *He. This same factor of about 2.5 appears in 
experiments with very different concentrations of *He, so it is not related to any 
interaction of >He atoms with other *He atoms. It also arises in other experiments, 
such as the study of heat conduction in 3He-*He mixtures, so it does indeed 
seem to describe the properties of the mixture itself rather than any particular 
experiment. 

The success of the Fermi-Dirac distribution function in accounting for the 
properties of such a diverse array of systems—metals, white dwarf stars, and 
dilute mixtures of *He in *He—is truly impressive. In the next chapter, we shall 
explore in more detail how both Bose-Einstein and Fermi-Dirac statistics can be 
used to help us understand various properties of solids. 
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FIGURE 10.24 The heat capacity of 
0.5 mole of a dilute (5%) mixture of 
3He in +He. The straight line is a fit 
to the linear portion of the plot below 
about 40 mK. 
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1. Suppose a container filled with a gas moves with constant 
velocity v. How is the Maxwell velocity distribution differ- 
ent for such a gas, compared with the same container of gas 
at rest? 

2. The population inversion necessary for the operation of a 
laser is sometimes called a “negative temperature.” What 
is the meaning of a negative temperature? Does it have a 
physical interpretation? 

3. Figure 10.25 shows two different experimental arrangements 
used to measure the distribution of molecular speeds. Based 
on the figures, explain how each apparatus might operate, 
and try to guess how the observed distribution of molecules 
might appear. Where do the fastest molecules land? The 
slowest? 

4. How would Figure 10.13 change if the temperature of the 
gas were increased? 

5. How is the speed distribution of a gas at temperature T 
different from that at temperature 27? The energy distribu- 
tion? Sketch the speed and energy distributions at the two 
temperatures. 
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FIGURE 10.25 Question 3. 
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Consider a mixture of two gases of molecular masses m, and 
m, = 2m, in thermal equilibrium at temperature 7. How 
do their speed distributions differ? How do their energy 
distributions differ? 

It is generally more convenient, wherever possible, to use 
Maxwell-Boltzmann statistics rather than quantum statis- 
tics. Under what circumstances can a quantum system be 
described by Maxwell-Boltzmann statistics? 

Suppose we had a gas of hydrogen atoms at relatively high 
density. Do the atoms behave as fermions or as bosons? 
Would a gas of deuterium (heavy hydrogen) atoms behave 


Statistical Analysis 


A collection of three noninteracting particles shares 3 units 
of energy. Each particle is restricted to having an integral 
number of units of energy. (a2) How many macrostates are 
there? (b) How many microstates are there in each of the 
macrostates? (c) What is the probability of finding one of 
the particles with 2 units of energy? With 0 units of energy? 
(a) Considering the numbers of heads and tails, how many 
macrostates are there when 5 coins are tossed? (b) What is 
the total number of possible microstates in tossing 5 coins? 
(c) Find the number of microstates for each macrostate, and 
be sure the total agrees with your answer to part (5). 
Consider a system consisting of two particles, one with 
spin s = | and another with spin s = 1/2. (a) Considering 
a microstate to be an assignment of the z component of 
the spins of each of the particles, what is the total num- 
ber of microstates of the two-particle system? (b) How 
many macrostates are there for the total spin of the two- 
particle system? (c) Find the number of microstates for each 
macrostate, and be sure the total number agrees with your 
answer to part (a). 


Classical and Quantum Statistics 


Calculate the probabilities for E = 0,3 and 5 listed in Table 
10.2. 

Calculate the probabilities given in Table 10.3 for E =0 
and E = 3 for (a) integral spin and (4) spin 1/2. 

A system of four oscillator-like particles shares 8 units 
of energy. (That is, the particles can accept energy only 
in equal units, in which the oscillator spacing is 1 unit.) 
(a) List the macrostates, and for each macrostate give the 
number of microstates for distinguishable classical particles, 
indistinguishable quantum particles with integral spin, and 
indistinguishable quantum particles with half-integral spin. 
(6) Calculate the probability to find a particle with exactly 
2 units of energy for each of the three different types of 
particles. 
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any differently? (Hint: The nuclear spin is 2 for hydrogen 
and 1 for deuterium.) 

The early universe contained a large density of neutrinos 
(massless spin- / particles that travel at the speed of light). 
Which statistical distribution would be needed to describe 
the properties of the neutrinos? 

Would you expect the photoelectric effect to depend on the 
temperature of the surface of the metal? Explain. 

Estimate the mean kinetic energy of the “free” electrons in 
a metal if they obeyed Maxwell-Boltzmann statistics. How 
does this compare with the result of applying Fermi-Dirac 
statistics? Why is there such a difference? 


A system consists of two particles, each of which has a spin 
of 3/2. (a) Assuming the particles to be distinguishable, what 
are the macrostates of the z component of the total spin, and 
what is the multiplicity of each? (b) What are the possible 
values of the total spin S and what is the multiplicity of 
each value? Verify that the total multiplicity matches that 
of part (a). (c) Now suppose the particles behave like indis- 
tinguishable quantum particles. What is the multiplicity of 
each of the macrostates of the z component of the total spin? 
(d) Show that for these quantum particles it is possible to 
have only combinations with total spin S = 3 or 1. 


The Density of States 


The universe is filled with photons left over from the 
Big Bang that today have an average energy of about 
2 x 107+ eV (corresponding to a temperature of 2.7 K). 
What is the number of available energy states per unit 
volume for these photons in an interval of 10~> eV? 

In certain semiconductors, the conducting regions are 
grown in very thin layers, which can be regarded as two- 
dimensional regions holding an electron gas. Calculate the 
density of states (per unit area) for a gas of particles of mass 
mand spin s confined to move in two dimensions in a square 
region of length L on each side. 

Calculate the density of states (per unit area) for a collection 
of photons confined to a two-dimensional region in the shape 
of a square a length L on each side. 

In a conductor like copper, each atom provides one electron 
that is available to conduct electric currents. If we assume 
that the electrons behave like a gas of particles at room tem- 
perature with a most probable energy of 0.0252 eV, what 
is the density of states in an interval of 1% about the most 
probable energy? 


The Maxwell-Boltzmann Distribution 


A system consists of N particles that can occupy two 
energy levels: a nondegenerate ground state and a three-fold 
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17. 


degenerate excited state, which is at an energy of 0.25 eV 
above the ground state. At a temperature of 960 K, find the 
number of particles in the ground state and in the excited 
State. 

A system with nondegenerate energy levels has three energy 
states: a ground state at E = 0 and excited states at energies 
of 0.045 eV and 0.135 eV. At a temperature of 650 K, find 
the relative numbers of particles in the three states. 

Show that the most probable speed v, of the Maxwell speed 
distribution is (2k7/m)/?. 

A container holds one mole of helium gas at a temperature of 
293 K. (a) Show that the mean energy E,, of the molecules 
is 0.0379 eV. (b) How many molecules have energies in an 
interval of width 0.01£,, centered on E,,,? 

A cubic container holds one mole of argon gas at a tem- 
perature of 293 K. (a) How many molecules have speeds 
between 500 and 510 m/s? (6) How many molecules have 
velocities between 500 and 510 m/s in one particular direc- 
tion? Explain any differences between the answers to (a) 
and (5). 

The photosphere of the Sun has a temperature of 5800 K. 
(a) Calculate the energy linewidth of the first transition in 
the Lyman series of hydrogen in the Sun’s photosphere. 
(6) For comparison, calculate the natural linewidth, assum- 
ing a lifetime of 107° s. 


10.5 Quantum Statistics 
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Do we expect to be able to use Maxwell-Boltzmann statis- 
tics to analyze (a) nitrogen gas at standard conditions (room 
temperature, 1 atmosphere pressure); (6) liquid water at 
room temperature; (c) liquid helium at 4 K; (d) conduction 
electrons in copper at room temperature? 

(a) What pressure must be applied to nitrogen gas at room 
temperature before Maxwell-Boltzmann statistics begins to 
fail? (b) To what temperature must we cool nitrogen gas at 1 
atmosphere before Maxwell-Boltzmann statistics begins to 
fail? 


10.6 Applications of Bose-Einstein Statistics 
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(a) Show that the total number of photons per unit volume at 
temperature T is N/V = 8x(kT/hc)> [5° x°dx/(e* — 1). (b) 
The value of the integral is about 2.404. How many photons 
per cubic centimeter are there in a cavity filled with radiation 
at T = 300K? At T =3K? 

A blackbody is radiating at a temperature of 2.50 x 10° K. 
(a) What is the total energy density of the radiation? (b) What 
fraction of the energy is emitted in the interval between 1.00 
and 1.05 eV? (c) What fraction is emitted between 10.00 and 
10.05 eV? 

Find the photon energy at which the blackbody energy 
spectrum u(Z) is a maximum. Compare this result with 
Wien’s displacement law (see Chapter 3) and account for 
any differences. 


10.7 Applications of Fermi-Dirac Statistics 
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Compute the Fermi energy and the average electron energy 
for copper. 

Calculate the Fermi energy for magnesium, assuming two 
free electrons per atom. 

A certain metal has a Fermi energy of 3.00 eV. Find the 
number of electrons per unit volume with energy between 
5.00 eV and 5.10 eV for (a) T = 295 K; (b) T = 2500K. 
Derive Eq. 10.52 from Eq. 10.51. 

Assume a neutron star consists of NV neutrons (fermions with 
spin !/2) in a sphere of radius R and uniform density. The 
star is in equilibrium because the inward gravitational force, 
which tends to collapse the star, is opposed by a repulsion 
due to the Pauli principle, which prevents the neutrons from 
moving closer together. (a) Find an expression for the radius 
of a neutron star. (b) Evaluate the radius for a star of mass 
equal to 3 solar masses. (c) What is the density of the star? 
Consider a neutron star of mass equal to twice the mass 
of the Sun. (a) Evaluate the Fermi energy and determine 
whether classical or relativistic kinematics should be used in 
the analysis. (b) Find the de Broglie wavelength of a neutron 
at the Fermi energy and compare with the average distance 
between neutrons. 

For a 5.0% mixture of 2He in 0.50 mole of *He, calculate 
the heat capacity in J/K at a temperature of 0.025 mK and 
compare with the data shown in Figure 10.24. The density 
of liquid *He at this temperature is 2.2 x 10? atoms/m?. 
Assume an effective mass of *He that is 2.5 times its ordinary 
mass. 
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Show that a system of 2 indistinguishable quantum particles 
with spin 2 can combine only to a total spin of 0, 2, or 4. 
Consider a collection of N noninteracting atoms with a single 
excited state at energy &. Assume the atoms obey Maxwell- 
Boltzmann statistics, and take both the ground state and the 
excited state to be nondegenerate. (a) At temperature T, 
what is the ratio of the number of atoms in the excited state 
to the number in the ground state? (b) What is the average 
energy of an atom in this system? (c) What is the total 
energy of the system? (d) What is the heat capacity of this 
system? 

Suppose we have a gas in thermal equilibrium at temperature 
T. Each molecule of the gas has mass m. (a) What is the 
ratio of the number of molecules at the Earth’s surface to the 
number at height / (with potential energy mgh)? (b) What is 
the ratio of the density of the gas at height h to the density 
Po at the surface? (c) Would you expect this simple model 
to give an adequate description of the Earth’s atmosphere? 
A collection of noninteracting hydrogen atoms is maintained 
in the 2p state in a magnetic field of strength 5.0 T. (a) At 
room temperature (293 K), find the fraction of the atoms in 
the m, = +1,0, and —1 states. (b) If the 2p state made a 
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transition to the 1s state, what would be the relative inten- 
sites of the three normal Zeeman components? Ignore any 
effects of electron spin. 

The following method is used to measure the molecular 
weight of very heavy molecules. A liquid containing the 
molecules is spun rapidly in a centrifuge, which estab- 
lishes a variation in the density of the liquid. The density 
is measured, such as by absorption of light, to determine 
the molecular weight. Assign a fictitious “centrifugal” force 
to act on the molecules and show that the density varies 
as p = pyeme* | 2kT where w is the angular velocity of the 
centrifuge and x measures the distance along the centrifuge 
tube. 

In sodium metal at room temperature, compute the energy 
difference between the points at which the Fermi-Dirac dis- 
tribution function has the values 0.1 and 0.9. What do you 
conclude about the “sharpness” of the distribution? 

In sodium metal (see Example 10.8), calculate the number of 
electrons per unit volume at room temperature in an interval 
of width 0.01, at the mean energy E,,. 

Protons and neutrons are spin-!/2 particles in the nucleus. 
Find the average energy of the protons and neutrons in the 
nucleus of a uranium atom, which contains 92 protons and 
143 neutrons and has the shape of a sphere of radius of 
7.4x 107) m. 

Consider a uniform spherical distribution of matter of radius 
r and density p. (a) Imagine that a small increment of mass 
dm is brought from infinity to radius r. What is the change 
in the gravitational potential energy of the system consisting 
of the sphere and this mass increment? (b) Suppose we 
bring in from infinity a series of small mass increments 
that eventually form a thin spherical shell of radius r and 
thickness dr about the central sphere. What is the change 
in the potential energy of the system? (c) What is the total 
change in potential energy involved with creating a sphere 
of mass M and radius R? 

(a) For a white dwarf star of mass equal to the mass 
of the Sun, find the de Broglie wavelength for electrons 
at the Fermi energy. Use nonrelativistic kinematics and 
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assume the star to be composed of helium nuclei (alpha 
particles) and of uniform density. (b) Estimate the aver- 
age distance between the electrons and compare with their 
de Broglie wavelength. What can you conclude from this 
comparison? 

Measuring the relative population of magnetic substates of 
nuclei provides a direct way of determining the temperature 
for very cold systems, using a thermometer that is absolute 
and needs no calibration. The nucleus °°Co behaves as if it 
has a spin of 5 and a magnetic moment of i = yS, where 
S represents the nuclear spin and y is a constant equal to 
3.64 x 10’ T~!s~!. When Co atoms are imbedded in a piece 
of magnetized iron, the Co nuclei experience a magnetic field 
of B — —Bk, where B = 29.0T and K is the unit vector in 
the z direction. (a) In a certain experiment using a Co in 
Fe thermometer, the ratio r of the population of the second 
lowest substate to that population of the lowest substate 
was observed to be r = 0.419. What is the corresponding 
temperature? (b) At that temperature, what is the ratio of 
the population of the m = 0 substate to that of the lowest 
substate? 

The molecule cyanogen (CN), which is commonly found in 
interstellar gas clouds, has rotational excited states that can 
absorb visible light. These rotational states are populated by 
the warming effect of the cosmic background radiation that 
is aremnant of the creation of the universe. The energy of the 
first excited rotational state (L = 1) is 4.71 x 10~+ eV above 
the ground state. (a) The ratio of the intensity of the radiation 
absorbed in the first excited rotational state to the intensity 
of the radiation absorbed in the rotational ground state is 
0.421 + 0.017. Assuming this factor represents the relative 
populations of the two states, calculate the temperature of 
the cyanogen molecules and its uncertainty. (b) Based on 
your deduced temperature, calculate the expected ratio of 
the absorption intensity from the second rotational state to 
that of the ground state and compare with the observed rela- 
tive intensity (0.0121 + 0.0014). Direct observation of this 
background radiation shows it to have the expected thermal 
radiation spectrum at this temperature (see Chapter 15). 
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In this chapter we study the way atoms or molecules combine to make solids. In 
particular, we discuss how the principles of quantum mechanics are essential in 
understanding the properties of solids. 

At first thought, there seem to be so many different solids that to classify them 
and form some general rules for their properties would appear to be a hopeless 
task. The book you are reading is made of paper and cloth, held together by a glue 
made from resins, once liquid and now solid. Your desk might be made of wood, 
metal, or plastic; your chair might be made of similar materials, and might perhaps 
be covered with cloth, leather, or plastic fabric and contain fiber or synthetic foam 
padding. Around you, there might be many books and papers, pencils made of 
wood and metal and graphite, rubber erasers, pens of metal and plastic. A plastic 
body and a liquid crystal display surround the semiconductors that lie at the heart 
of your calculator or computer, your cell phone, and your portable multimedia 
player. Looking out through a glass window, you see structures made of wood, 
bricks, concrete, or metal, selected for strength, utility, or attractiveness. Each of 
these solids has a characteristic color, texture, strength, hardness, or ductility; it has 
a certain measurable electrical conductivity, heat capacity, thermal conductivity, 
magnetic susceptibility, and melting point; it has certain characteristic emission 
or absorption spectra in the visible, infrared, ultraviolet, or other regions of the 
electromagnetic spectrum. 

It is a fair generalization to say that all of these properties depend on two 
features of the structure of the material: the type of atoms or molecules of which 
the substance is made, and the way those atoms or molecules are joined or 
stacked together to make the solid. It is the formidable task of the solid-state (or 
condensed-matter) physicist or physical chemist to try to relate the structure of 
materials to their observed physical or chemical properties. 

Quantum mechanics plays a fundamental role in determining properties of 
the solid: mechanical, electrical, thermal, magnetic, optical, and so forth. In this 
chapter we illustrate the application of quantum mechanics to the study of solids 
by studying some of their thermal, electrical, and magnetic properties. 


11.1 CRYSTAL STRUCTURES 


Our discussion will concentrate on materials in which the atoms or molecules 
occupy regular or periodic sites; this structure is called a Jattice, and materials 
with this structure are called crystals. Crystalline materials include many metals, 
chemical salts, and semiconductors. One property that distinguishes crystals is 
their long-range order—once we begin constructing the lattice in one location, 
we determine the placement of atoms that are quite far away. In this respect, the 
crystal is like a brick wall, in which the bricks are stacked in a periodic array and 
the placement of a brick is predetermined by the original arrangement of bricks 
far away compared with the size of a brick. (By contrast, amorphous materials 
such as glass or paper have no long-range order, and their structure is more 
similar to a pile of bricks than to a brick wall.) 

Solid crystals can be classified by the cohesive forces that are responsible for 
holding the lattice together, as well as by the shape of the arrangement of the 
atoms in the lattice. We’ll look at a few different ways that atoms can be bound 
together in solids. 


lonic Solids 


As we learned in Chapter 9, the cohesive forces in ionic molecules originate 
from the electrostatic attraction between a closed-shell ion, such as Na*, and 
another closed-shell ion, such as Cl. Ionic materials can also form solids readily, 
because a Na* ion can simultaneously attract many Cl” ions to itself, thereby 
building up a solid structure. The ions are held together by electrostatic forces, 
so we might suppose that the more negative ions there are around a positive ion, 
the more stable and strong the solid will be. (Covalent bonds, on the other hand, 
involve specific electron wave functions and so are limited in the number of near 
neighbors that can participate in the bonding.) 

Ionic solids are crystalline, rather than amorphous, because we can pack ions 
together more efficiently in a regular array than in a random arrangement. (The 
same is true for bricks: In a regular array, there are more bricks per unit volume 
than in a random pile, and their average separation is smaller.) 

The simplest type of crystal lattice is the cubic lattice, in which we imagine the 
atoms to be placed at the corners of a succession of cubes that cover the volume 
of the crystal. Figure 11.1 shows the basic cubic structure. This type of stacking 
is not the most efficient, because there are large gaps at the center of each face 
of the cube, and also in the middle of the cube itself. We get a better stacking 
arrangement, which has more atoms per unit volume, if we place another atom 
either at the center of each face of the cube or at the center of the body of the 
cube. These two lattices are known as face-centered cubic (fcc) and body-centered 
cubic (bec) and are illustrated in Figures 11.2 and 11.3. 

The fcc lattice gives a slightly more efficient packing (more atoms per unit 
volume) and so it is usually the most stable structure. However, atoms do not stack 
like hard spheres, and often the bcc structure is preferred. These two crystal types, 
fcc and bec, also occur for materials other than ionic solids, such as certain metals. 

A common material that has the fcc lattice structure is NaCl, and for that reason 
the fcc lattice is often called the NaC! structure. In order to have the atoms attract 
one another, we must alternate Nat and CI” ions, as is shown in Figure 11.4. 


FIGURE 11.1. The simple 
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cubic crystal. The atoms are FIGURE 11.2 The face-centered FIGURE 11.3 The body-centered 
shown as small spheres for cubic structure. cubic structure. 


clarity; in an actual solid, the 
atoms should be imagined as 
spheres in contact in this cubic 
geometry. 
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FIGURE 11.4 Packing of Na (small 
spheres) and Cl (large spheres) in a 
fee crystal of NaCl. 





FIGURE 11.5 Packing of Cs (large 
spheres) and Cl (small spheres) in a 
bee crystal of CsCl. 


In this illustration you can see how the atoms pack together just like hard spheres 
in contact. Notice that a given Na™ ion is attracted by 6 close Cl” neighbors, and 
does not “belong to” any single Cl” ion. /t is therefore wrong to consider ionic 
solids as being composed of molecules. 

A typical bec structure is CsCl, as shown in Figure 11.5, and so the bcc lattice 
is often known as the CsCl structure. In this case each ion is surrounded by 8 
neighbors of the opposite charge. 

Each Na?* ion in the NaCl structure is surrounded at a distance R by 6 Cl” ions 
exerting attractive electrostatic forces. At the slightly larger distance of R/2 from 
each Na* ion are 12 Na* ions exerting repulsive forces, and at the still greater 
distance of R./3 there are 8 Cl” ions exerting attractive forces. To find the total 
Coulomb potential energy Uc, we can continue in this way to add the alternating 
attractive and repulsive contributions: 


N92 e ( 1 1 1 


Uc= = 12 —8 ia 
~ 2 decor Ane = = 





~6R RV2.— RV3 
(11.1) 





e 1 ( 12 8 ) e | 
es 6 = + ip Sas = —q —_— — 
Amey R Wd «/3: Amey R 
where a, called the Madelung constant, is the factor in parentheses in Eq. 11.1: 
12 8 
Sn + Ss SS ie 
v2 V3 


This quantity depends only on the geometry of the lattice and is evaluated by 
summing the slowly converging series of alternating positive and negative terms. 
The result is 


a =6 (11.2) 


a = 1.7476 (fec or NaCl lattice) 


For the bec lattice, a similar calculation gives 


a = 1.7627 (bec or CsCl lattice) 


As in the case of ionic molecules, the net attractive electrostatic force is opposed 
by a repulsive force due to the Pauli principle, which keeps the filled subshells 
from overlapping. The repulsive potential energy can be approximated as 


Up = AR™ (11.3) 


where A gives the strength of the potential energy and n determines how rapidly 
it increases at small R. For most ionic crystals, n is in the 8—10 range. The total 
potential energy of an ion in the lattice is the sum of the Coulomb and repulsive 
potential energies: 
2 
e | A 
U=U.+ Ug = -a— = + — 11.4 
CFR ROR" ar) 
The energies are illustrated in Figure 11.6. There is a stable minimum in the 
energy, that determines both the equilibrium separation Ry and the binding 
energy. To find this minimum, we set dU/dR to zero, which gives 
ae*Re! 


A= 11.5 





TABLE 11.1 Properties of lonic Crystals 
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Nearest-Neighbor Cohesive Energy 
Separation (kJ/mol) n Structure 
LiF 0.201 1030 6 fec 
LiCl 0.257 834 7 fec 
NaCl 0.281 769 9 fec 
Nal 0.324 682 QS iE 
KCl 0.315 701 9 fec 
KBr 0.330 671 Y3 fec 
RbF 0.282 774 8.5 fcc 
RbCl 0.329 680 VS fec 
CsCl 0.356 657 10.5 bec 
CsI 0.395 600 12, bec 
MgO 0.210 3795 7 fec 
BaO 0.275 3029 95 fec 
The binding energy B of an ion in the crystal is the depth of the energy well at 
R= Ro, the equilibrium separation between nearest-neighbor ions. Substituting 
Eq. 11.5 into Eq. 11.4 and evaluating the resulting equation at R = Ro, we obtain 
ae? 1 
B= —U(Ry) = ——— [1 -- (11.6) 
Ame Ro n 
From thermodynamic measurements, it is possible to determine the bulk : 
cohesive energy of a solid. In effect, the cohesive energy of an ionic solid is 10 ‘ 
defined as the energy necessary to dismantle the solid into individual ions. Some : 
measured values of the cohesive energies and nearest-neighbor spacings are given f 
in Table 11.1. The value of the exponent 7 is determined from compressibility 5 UR 
data. ' 
The cohesive energy of a bulk sample can be calculated by multiplying the _ - . 
binding energy for a single ion, determined from Eq. 11.6, by the number of ions 3 5 Ro, Separation (nm) 
in the sample, except that such a calculation would count each ion twice.* Inone & 0j2 ! 04 | 06 
mole of an ionic solid, there are Avogadro’s number N, of positive ions and also 5 
N, negative ions, for a total of 2N, ions per mole. The relationship between the 
molar cohesive energy £,,,, and the ionic binding energy B is then “5 
Econ = 3(B)(2Nq) = BNy (11.7) i 
-10 7 
/ 
SS S= 1Uc 
*Consider ions A and B. If we use Eq. 11.6 to compute the binding energy of ion A, the result includes 


the interaction of ion A with a// the ions of the solid, including ion B. Similarly, the binding energy of 
ion B calculated from Eq. 11.6 includes the interaction of B with A. If we calculated the total binding 
energy of the solid by adding together the binding energies of all ions A and B, we would be including 
the interaction between A and B twice. 





















































FIGURE 11.6 Contributions to the 
energy of an ionic crystal. Numerical 
values are for NaCl. 
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where the factor of 1/2 corrects for the problem of double counting of the ions. 
The following examples illustrate the relationship between cohesive energy of 
the bulk solid and the binding energy per ion pair. 


| Example 11.1 





(a) Determine the experimental value of the binding energy 
of an ion pair in the NaCl lattice from the cohesive energy. 
(5) Find the expected value of the binding energy based on 
the lattice parameters. 


Solution 
(a) From Eq. 11.7, we have 
Ba beh 769 x 10° J/mol 





Ny (6.02 x 1023 ions/mol)(1.60 x 10-19 J/eV) 
= 7.98 eV 


(b) The calculated value of the ionic binding energy is 
obtained from Eq. 11.6: 


i (1--) 
Are Ro n 


__ (1.7476)(1.44 eV -nm) 
- 0.281 nm 








(0.889) = 7.96 eV 


The agreement between the experimental and calculated 
values is very good. 





| Example 11.2 


How much energy per neutral atom would be needed to 
take apart a crystal of NaCl? 


Solution 

If we supply an energy of E,,, to a mole of NaCl, we 
obtain N, Nat ions and N, Cl” ions. To convert these to 
neutral atoms, we must remove an electron from each Cl, 
which costs us the electron affinity of Cl (3.61 eV), and 


then we must attach that electron to the Nat, which returns 
the ionization energy of Na (5.14 eV). The net cost per pair 
of Na and Cl atoms is 


7.98 eV +3.6leV —5.14eV = 6.45 eV 


Expending this much energy gives two neutral atoms (Na 
and Cl), so the net cost per atom is half that amount, or 
3.23 eV. 








FIGURE 11.7 The tetrahedral struc- 
ture of carbon. 


The large cohesive energies of ionic solids such as NaCl gives them a common 
set of properties: They are hard, with high melting and vaporization temperatures 
(because it takes a lot of thermal energy to break the bonds). They are soluble in 
polar liquids such as water, in which the dipole moment of the water molecule 
can supply the electrostatic force necessary to break the ionic bonds. There are no 
free or valence electrons, so they are poor electrical conductors and not strongly 
magnetic. They are transparent to visible light (because light rays have too little 
energy to excite electrons from the filled shells), but absorb strongly in the infrared 
(corresponding to the vibrational frequencies of the atoms in their lattice sites). 


Covalent Solids 


As we discussed in Chapter 9, carbon forms molecules by covalent bonding of 
its four outer electrons in sp? hybrid orbits. Such bonds are highly directional, 
and we have seen how it is possible to calculate the angle between the bonds 
based on the symmetry of the bonding configuration. Solid carbon, in the form of 
diamond, is an example of a solid in which the interatomic forces are also of a 
covalent nature. As in a molecule, the four equivalent sp* hybrid states participate 
in covalent bonds, and because they are equivalent they must make equal angles 
with one another. The manner in which this is done is shown in Figure 11.7. A 
central carbon atom is covalently bound to four other carbons that occupy four 


TABLE 11.2 Some Covalent Solids 
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Crystal Nearest-Neighbor Distance (nm) | Cohesive Energy (kJ/mol) 
ZnS 0.235 609 
C (diamond) 0.154 710 
Si 0.234 447 
Ge 0.244 3712 
Sn 0.280 303 
CuCl 0.236 921 
GaSb 0.265 580 
InAs 0.262 549 
SiC 0.189 1185 








corners of a cube as shown. The angle between the bonds is 109.5°, as it was in 
the covalently bonded molecules. 

Figure 11.8 illustrates how the solid structure characteristic of diamond is 
constructed of such bonds. Each carbon has four close neighbors with which it 
shares electrons in covalent bonds. The basic structure is known as tetrahedral, 
and many compounds have a similar structure as a result of covalent bonding. 
Table 11.2 shows some of these compounds. The cohesive energy is the energy 
required to dismantle the solid into individual atoms. The structure is also known 
as the zinc sulfide or zinc blende structure. 

Some of the covalent solids listed in Table 11.2 have bond energies larger 
than those of ionic solids. Substances such as diamond and silicon carbide are 
particularly hard. Other covalent solids with structures similar to carbon are silicon 
and germanium; the structure of these solids is responsible for their behavior as 
semiconductors. 

The covalent solids do not have the same similarity of characteristics that ionic 
solids do, and so we cannot make the same generalizations. Carbon, in the diamond 
structure, has a large bond energy and is therefore very hard and transparent to 
visible light; germanium and tin have similar structures, but are metallic in 
appearance and highly reflective. Carbon (as diamond) has an extremely high 
melting point (4000 K); germanium and tin melt at much lower temperatures 
more characteristic of ordinary metals. Some (like diamond) are extremely poor 
electrical conductors, while others (like Si, Ge, and Sn) can conduct electricity but 
not nearly as well as most metals. Of course, these differences depend on the actual 
bond energy in the solid, which in turn depends on the type of atoms of which the 
solid is made. Those solids with large bond energies are hard, have high melting 
points, are poor electrical and thermal conductors, and are transparent to visible 
light. Those solids with small bond energies may have very different properties. 


Metallic Bonds 


The valence electrons in a metal are usually rather loosely bound, and frequently 
the electronic shells are only partially filled, so that metals tend not to form 
covalent bonds. The basic structure of metals is a “sea” or “gas” of approximately 
free electrons surrounding a lattice of positive ions. The metal is held together by 
the attractive force between each individual metal ion and the electron gas. 












































FIGURE 11.8 The lattice structure of 
diamond. 
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TABLE 11.3 Structure of Metallic Crystals 



































Metal Crystal Type Nearest-Neighbor Distance (nm) Cohesive Energy (kJ/mol) 
Fe bec 0.248 418 
IL3i bee 0.304 158 
Na bec 0.372 107 
Cu fec 0.256 337 
Ag fee 0.289 285 
Pb fee 0.350 196 
Co hep 0.251 424 
Zn hep 0.266 130 
Ed hep 0.298 nal, 























FIGURE 11.9 Arrangement of atoms 
in a hexagonal close-packed crystal. 





The most common crystal structures of metallic solids are fcc, bec, or a third 
type known as hexagonal close-packed (hcp). The hep structure is shown in 
Figure 11.9; like the fcc structure, it is a particularly efficient way of packing 
atoms together. Some metals and their characteristics are shown in Table 11.3. 
The cohesive energy of metal bonds tends to fall in the range 100—400 kJ/mol 
(1—4eV/atom), making the metals less strongly bound than ionic or covalent 
solids. As a result, many metals have relatively low melting points (some below 
a few hundred °C). The relatively free electrons in the metal interact readily with 
photons of visible light, so metals are not transparent. The free electrons are 
responsible for the high electrical and thermal conductivity of metals. Because 
metallic bonds don’t depend on any particular sharing or exchange of electrons 
between specific atoms, the exact nature of the atoms of the metal is not as 
important as it is in the case of ionic or covalent solids; as a result we can make 
many kinds of metallic alloys by mixing together different metals in varying 
proportions. 


Molecular Solids 


None of the solids we have discussed so far can be considered as composed of 
individual molecules. It is, however, possible for molecules to exert forces on one 
another and to bind together in solids. The electrons in a molecule are already 
shared in molecular bonds, so there are no available electrons to participate in 
ionic, covalent, or metallic bonds with other molecules. Moreover, molecules are 
electrically neutral, so there are no Coulomb forces involved. Molecular solids 
are held together by much weaker forces, which generally depend on the electric 
dipole moments of the molecules. Because these forces are much weaker than the 
internal forces that hold a molecule together, a molecule can retain its identity in 
a molecular solid. 

The electric dipole moment of one molecule can exert an attractive force on 
the dipole moment of another. The dipole cohesive force (which is proportional 
to 1/R) in molecular solids is generally weaker than the 1/R? Coulomb force 
that is responsible for the cohesive energies of other solids. Molecular solids are 
therefore more weakly bound and have lower melting points than ionic, covalent, 
or metallic solids, because it takes less thermal energy to break the bonds of a 
molecular solid. 


Some molecules (called polar molecules) have permanent electric dipole 
moments consisting of a positive charge on one end of the molecule and an 
equal negative charge on the opposite end. For example, in a water molecule, the 
oxygen atom tends to attract all of the electrons of the molecule and so looks 
like the negative end of the dipole; the two “bare” protons are the positive ends 
of the dipole. The dipole forces between water molecules are responsible for the 
beautiful hexagonal patterns of snowflakes. When bonding of this sort involves 
hydrogen atoms, as it does in water, it is known as hydrogen bonding. 

It is also possible to have dipole forces exerted between atoms or molecules 
that have no permanent dipole moments. Quantum mechanical fluctuations* can 
produce an instantaneous electric dipole moment in one atom, which then induces a 
dipole moment by polarizing a neighboring atom. The result is an attractive dipole- 
dipole force known as the van der Waals force, which is responsible for the bonding 
in certain molecular solids (as well as for such physical effects as surface tension 
and friction). Examples of solids that are bound by the van der Waals force include 
those composed of the inert gases (Ne, Ar, Kr, and Xe), symmetric molecules 
such as CH, and GeCl,, halogens, and other gases such as H,, N>, and Op. 

The van der Waals force is extremely weak; it falls off with separation distance 
like R~’. In inert gas crystals, the nearest neighbor distance is 0.3—0.4nm, but 
the cohesive energies are typically only 10 kJ/mol or 0.1 eV/atom. Solids bound 
by these weak forces have low melting points, because little thermal energy is 
required to break the bonds. In fact, because the induced dipole moment of an atom 
or molecule should be approximately proportional to its total number of electrons, 
we might expect that the melting points of nonpolar molecular solids should be 
roughly proportional to the number of electrons in each molecule. Figure 11.10 
shows this relationship; although the properties of the individual solids cause 
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FIGURE 11.10 The melting points of molecular solids depend approximately on 
the number of electrons per molecule. 


*These fluctuations are too rapid to be observed in the laboratory. Measurements give only the average 
value of this fluctuating dipole moment, which is zero. 
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FIGURE 11.11 Molar heat capacity 
of Pb (diamonds), Cu (squares), and 
Cr (triangles) at temperatures below 
100 K. The room temperature values 
are shown at the right. 


considerable scatter of the points, the relationship is roughly as we expect it 
should be. 


11.2 THE HEAT CAPACITY OF SOLIDS 


Just as we discussed in Chapter | for the heat capacity of gases, the heat capacity 
of solids provides another example of the breakdown of classical statistical 
mechanics and the need for a more detailed theory based on quantum mechanics. 
(You might find it helpful to review the classical calculation of the heat capacity 
for gases in Section 1.3.) 

Let’s first consider what classical thermodynamics predicts for the heat capacity 
of a solid. In contrast to a gas, an atom in a solid occupies a specific position in 
the lattice, so no translational motion is possible. Thus there are no degrees of 
freedom corresponding to the translational motion. The atom can move only by 
vibrating about its equilibrium position in the lattice. We can imagine the atom 
to behave as if it were connected to all of its closest neighbors by springs. It 
can vibrate in any of the three coordinate directions independently of the other 
two—the initial displacements of the springs in the x,y, and z directions can 
be chosen independently, and the initial velocities in each direction can be set 
independently of one another. Consequently there are 6 degrees of freedom in this 
situation—2 degrees of freedom (corresponding to the vibrational potential and 
kinetic energies) for each of the three directions. According to the equipartition 
theorem, the average energy for each degree of freedom is SAT , so the average 
energy per atom is 6 x SkT = 3kT. The total internal energy of one mole (Vy 
atoms) would then be £;,, = 3NakT = 3RT (where R = N,k is the universal gas 
constant), and the corresponding molar heat capacity is 


AE: int 
AT 


This is the expected value of the molar heat capacity of solids based on classical 
statistical mechanics and is known as the law of Dulong and Petit. 

How well does this prediction compare with experiment? Table 11.4 shows 
some values of the molar heat capacities at room temperature (approximately 
300K) and at 100K and 25K for some metallic elements. There is good 
agreement with the Dulong and Petit prediction at room temperature, but 
poor agreement as the temperature is reduced. (Note that the classical value is 
independent of temperature.) 

Figure 11.11 shows the temperature dependence of the heat capacities of Pb, 
Cu, and Cr at temperatures between 1K and 100K. It appears that the heat 
capacity approaches 0 at the lowest temperatures. As the temperature is increased, 
the heat capacity rises, eventually reaching the Dulong and Petit value at high 
enough temperature. However, the rate of increase is very different for these 
metals: Pb rises quickly (approaching the Dulong and Petit value by 100 K), Cu 
rises more slowly, and Cr rises even more slowly. 

Clearly the classical calculation fails to account for the heat capacities of 
these solids. One possible resolution of this problem would be to consider the 
application of quantum statistics to the electrons in these metals. In Chapter 10, 
we discussed the heat capacity of a gas of fermions. We applied the model to 
a dilute solution of *He in +He, but the result can apply equally as well to any 
system of particles governed by Fermi-Dirac statistics. 


C= = 3R = 24.9 J/mol-K 





(11.8) 
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TABLE 11.4 Heat Capacities of Common Metals* 












































T=300K T=100K T=25K 
Metal J/kg-K J/mole-K J/mole-K J/mole-K 
Al 0.904 23.4 12.8 0.420 
Ag 0.235 24.3 20.0 3.05 
Au 0.129 23.4 215) 5.11 
Ci; 0.461 23.8 10.0 0.199 
Cu 0.387 2339 16.0 0.971 
Fe 0.450 24.6 12.0 0.398 
Pb 0.128 24.7 23.8 14.0 
Sn 0.222 23.8 22.0 6.80 
*The value of the heat capacity depends on the circumstances under which it is 
determined. Usually measured values are observed at constant pressure (Cp), while 


calculated values are more easily obtained for constant volume (C7). The first data 
column in this table is the experimental specific heat (heat capacity per unit mass) at 
constant pressure, and the remaining columns all give the molar heat capacity at constant 
volume. In most cases Cp is just a few percent larger than Cy. 


We can treat the electrons in a metal as a Fermi gas. In deriving Eq. 10.63 
for the heat capacity, the only assumption we made was that kT < Ey. For most 
metals, E; is a few eV and even at room temperature kT is only 0.025 eV, so the 
approximation should be pretty good. Let’s rewrite Eq. 10.63 for one mole of a 
substance (NV = N,) as follows: 

m?k’N,T =? RkT 
C= = (11.9) 
2Ep 2 Ep 





where R = 8.31 J/mole-K. Equation 11.9 is written as if each atom of the lattice 
contributes one electron to the electron gas, so that NV (the number of electrons) is 
equal to NV, (for one mole of atoms). If, for example, the metal had a valence of 
2, then we would have NV = 2N,. 

For copper Ey = 7.03 eV and Eq. 11.9 gives C = 0.146 J/mole-K at room 
temperature. This value is far smaller than the experimental value, indicating that 
the electrons provide only a small contribution to the heat capacity, at least at 
room temperature. So the correct explanation for the behavior of the heat capacity 
must lie elsewhere than the electrons. 


Einstein Theory of Heat Capacity 


In an ordinary solid, most of the physical properties originate either with the 
valence electrons or with the latticework of atoms. Electrical conductivity, 
for example, originates with the valence electrons, while the propagation of 
mechanical waves is due to the lattice of atoms. The heat capacities of solids have 
contributions from both lattice and conduction electrons; at all but the lowest 
temperatures, the lattice contribution is dominant. 

The explanation for the failure of classical physics to account for the heat 
capacity of solids was first given by Einstein, who assumed that the oscillations 
(not the atoms) of the solid obeyed Bose-Einstein statistics. Just as electromagnetic 
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FIGURE 11.12 Molar heat capacity 
of Cu. The solid curve gives the tem- 
perature dependence expected accord- 
ing to the Einstein theory (Eq. 11.11). 


waves are analyzed as “particles” (quanta of electromagnetic energy, or photons) 
that obey Bose-Einstein statistics, so are mechanical or acoustic waves analyzed 
as “particles” (quanta of vibrational energy, called phonons) that also obey 
Bose-Einstein statistics. Einstein made the simplifying assumption that all of the 
phonons (oscillations) have the same frequency. 

We have seen in Chapter 5 that a quantized oscillator has an energy of 
ha(n+ 5). Each additional value of n represents an additional phonon; to go 
from a vibrational energy of hw to tho we must “create” a phonon of energy fw. 
One mole of the solid contains N, atoms and thus 3N, oscillators. The density 
of states (number of states per unit volume) is thus 3V,/V, and the integral of 
Eq. 10.7 is evaluated only at the single energy E = fw (because all phonons have 
energy fw). Using the Bose-Einstein distribution, the number of phonons is then 
N = 3N,/(e%/*" — 1), and the total internal energy of the solid is the number of 
phonons times the energy of each phonon: 











The heat capacity can be found from dE;,,/dT: 
dE ing _ (eh!) (iw /kT*) 
C= aT 3Nahw (elw/kT _ 1)2 
ha. 2 elio/kT T. 2 elp/T 
= 3k (7) (honk aye = 3 ( 2) gat—pp ND) 


where we have replaced fiw/k with the parameter 7;,, called the Einstein temper- 
ature. The vibrational energy fw (or the Einstein temperature 7;,) is an adjustable 
parameter of the theory and takes different values for different materials. Typically, 
T; is of the order of several hundred kelvins. 

When T7 is small, the exponential term in the denominator dominates, and 
C « e~'8/", so indeed C approaches 0 for small 7, in agreement with experiment. 
Figure 11.12 shows the molar heat capacity of Cu compared with the behavior 
predicted by Eq. 11.11, with 7, = 225 K giving the best fit to the data. As you 
can see, the agreement is reasonably good. However, even though the shape of 
the theoretical curve matches the overall trend of the data, it fails to do a good job 
at accounting for the behavior at the lowest temperatures (the data approach zero 
more slowly than the theory predicts). 

In this calculation we have oversimplified by assuming all of the oscillations 
to have the same frequency. A better calculation, which was first done in 1912 
by Peter Debye*, assumes a distribution of frequencies with a density of states 
given by an expression of the same form as that for the “photon gas” of blackbody 
radiation; the predicted low temperature behavior is then 


12x4_ /( T\? 
c= R( — (11.12) 
5 T 





where 7p is a parameter of the theory known as the Debye temperature, which is 
different for different materials. 


*Peter Debye (1884-1966) was born in the Netherlands but spent most of his academic career in 
German universities (where at one point he served as Schrédinger’s professor) and finally moved to 
the U.S. in 1940. He is perhaps best known for his analysis of X-ray diffraction patterns (such as 
Figure 3.8), for which he received the 1936 Nobel Prize in chemistry. 
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At the lowest temperatures, we therefore can identify two terms in the heat 
capacity: a term due to the electrons, which is linear in the temperature (Eq. 11.9), 
and another term due to the lattice vibrations of the atoms, which is proportional to 
T>. Combining these two terms, we then expect the low-temperature heat capacity 
to be of the form C = aT + bT?, where a is the coefficient of T in Eq. 11.9 and 
b is the coefficient of T? in Eq. 11.12. As T approaches 0, the 7? term drops off 
more rapidly than the linear term, so at the very lowest temperatures we expect 
the electrons to have a more significant contribution. We can turn this equation 
into a linear graph and identify both contributions by writing C/T = a + bT* and 
plotting C/T as a function of 7”, which should give a straight line of slope b and 
y-intercept a. Figure 11.13 shows the results for copper. The data do indeed fall 
on a straight line, in excellent agreement with the Debye theory. If the electronic 
part if the heat capacity were not present (that is, if there were only the lattice 
contribution from Eq. 11.12), then the line would go through the origin and the 
intercept would be zero. So the intercept tells us about the electronic contribution 
to the heat capacity. The slope of the line tells us about the lattice contribution 
and depends on the Debye temperature. 


| Example 11.3 
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FIGURE 11.13 Molar heat capacity 
for Cu, plotted as C/T against T”. The 
slope gives the lattice contribution and 
the intercept gives the contribution of 
the electrons. 





(a) From the slope of the line in Figure 11.13, determine the (b) The intercept a is the coefficient of T in the electronic 
Debye temperature of copper. (6) Using the Fermi energy _ contribution to the heat capacity (Eq. 11.9): 





of copper (7.03 eV), determine the expected value of the Bye. 
intercept. a= 

2Ep 
Solution 


(a) The slope b is equal to the coefficient of 7? in Eq. 11.12, 


_ 1°(8.617 x 10~° eV/K)(8.31 J/mole- K) 





so b = 1274R/ST;, and 








be (2x8) " =| 12m4(8.31 J/mole- K) } 
Dm  «56 ~ 1 5(4.80 x 10-5 J/mole- K*) 
= 343K 


2(7.03 eV) 


= 5.03 x 10~* J/mole- K” 





The expected value of the intercept based on the free-electron model doesn’t quite 
agree with the experimental value from Figure 11.13 for exactly the same reason 
that our analysis of the *He data in Chapter 10 didn’t agree with the predictions: 
an electron moving through a copper lattice doesn’t behave as a free electron 
would in an electron gas. We can account for the forces exerted by the lattice on 
the electrons by assigning the electrons an “effective mass” that is larger than the 
mass of a free electron. The additional mass accounts for the “sluggish” behavior 
of the electrons in moving through the lattice. The mass enters the calculation 
through the Fermi energy (Eq. 10.50); making the mass larger results in a smaller 
value of EZ, and thus a larger value of the intercept a. For copper, the effective 
mass of the electrons is about 1.4 times their free mass. 

The Debye theory also gives good agreement with the experimental data at 
higher temperatures. Except at room temperature, the data of Table 11.4 seem to 
have little in common. At 100 K the heat capacities vary by more than a factor of 2, 
and they vary by two orders of magnitude at 25 K. In the Debye theory, the heat 
capacity for any substance can be written as a function of 7/Tp. If we plot the heat 
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FIGURE 11.14 The heat capacity for 
eight different metals plotted against 
T/T. All values fall along the same 
curve calculated from the Debye 
theory. 
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FIGURE 11.15 (a) The density of 


states factor for electrons (Eq. 10.12). 
(b) The Fermi-Dirac distribution func- 
tion (Eq. 10.34). (c) The number of 
occupied states per unit energy inter- 
val, determined from the product of 
(a) and (b). 


capacities of the eight metals listed in Table 11.4 against 7/7p, the large variation 
among the values for different metals disappears, as shown in Figure 11.14. 
The data for all substances fall along the same curve calculated from the Debye 
theory. Understanding these widely different materials in a common basis is a 
great triumph for the quantum theory and for the application of Bose-Einstein 
statistics. 


11.3 ELECTRONS IN METALS 


In metals, each atom contributes one or more loosely bound electrons to an 
“electron gas” of nearly free electrons that can easily move throughout the metal. 
In analogy with an ordinary molecular gas, these electrons move freely and 
experience forces only when they scatter from the ion cores in the lattice. For 
now, we’ll assume the distribution of occupied electron states is determined by 
the density of states for the electron gas and the Fermi-Dirac distribution function. 
In the next section, we will see that a more detailed analysis of the properties 
of the interaction of the electrons with the atoms of the lattice forbids certain 
ranges of energy values, but we’ Il ignore that effect for this discussion. With these 
assumptions, we can use the electron gas model to study many of the properties 
of metals, such as electrical conduction, heat capacity, and heat conduction. 

Figure 11.15 reviews the main details of the Fermi-Dirac energy distribution, 
which we discussed in Chapter 10, as it might be applied to electrons in metals. 
The distribution of occupied electron states is determined by the product of the 
density of states factor, Eq. 10.12, and the Fermi-Dirac distribution function, Eq. 
10.34. At T = 0, all states above the Fermi energy Ey; are empty and all states 
below Ey are occupied. For temperatures greater than 0, Ey identifies the point 
at which the Fermi-Dirac factor has the value !/2. The difference between N(£) 
at T = 0 and at room temperature was illustrated in Figure 10.22; only a small 
number of electrons near Ey are affected by the temperature change. 

We calculated the Fermi energy at T = 0 by using Eq. 10.49, in which the 
integral of N(£) over all energies gives the total number of electrons V. The same 
procedure can be used to find E; at any temperature: 


El? dE 
eE-E)/AT 4] 





foe) 8./27V 3/2 eo) 
v= f N(E)dE = vanVm / 
0 0 


7B (11.13) 
In principle, we can evaluate the integral and solve for E;, as we did to obtain 
Eq. 10.50. However, the integral cannot be evaluated in closed form. The solution 
can be approximated as 


m2 ( kT \* 
Ey (T) © E,(0) [ =a (=a) 


Here £,(7) represents the Fermi energy at temperature T and £,(0) represents 
the Fermi energy at T = 0 (Eq. 10.50). At room temperature, kT = 0.025 eV, and 
for most metals the Fermi energy is a few eV, so the change in the Fermi energy 
between 0 K and room temperature is only about 1 part in 10*. We can therefore 





(11.14) 
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regard the Fermi energy as a constant for our applications, and we will represent TABLE 11.5 Fermi Energies of Some 
































it simply as Ey. Table 11.5 shows the Fermi energies of some metals. Metals 
Electrical Conduction Mel Ny) 
When an electric field E is applied to a metal, a current flows in the direction of Ag 258 
the field. The flow of charges is described in terms of a current density j, the Au S52) 
current per unit cross-sectional area. In an ordinary metal, the current density is Ba 3.65 
proportional to the applied electric field: 
7 m Ca 4.72 
j=oE (11.15) Ge 1.52 
where the proportionality constant o is the electrical conductivity of the material. Cu 7.03 
We would like to understand the conductivity in terms of the properties of the Li 4.70 
metal. 5 = 
The free electrons in our electron gas experience a force F = —eE and a Me il 
corresponding acceleration —eE/m. We observe that in a conductor the current Na 3.15 











is constant in time, so the increase in velocity from the electric field must be 
opposed, in this case by collisions with the lattice. This model of conduction in 
metals views the electrons as accelerated by the field only for short intervals, 
following which they are slowed by collisions. The net result is that the electrons 
acquire on the average a steady drift velocity Vg, given by the acceleration times 
the average time t between collisions: 

_ ue 

Vqg=—tT (11.16) 

m 

The magnitude of the current density is determined by the number of charge 
carriers and their average speed: 


j =—nev, (11.17) 


where 7 is the density of electrons available for conduction. Substituting for the 
drift velocity, we obtain 








> net > 
j= (11.18) 
m 
and the conductivity is therefore 
ja (11.19) 
m 


The unknown factor in Eq. 11.19 is the time between collisions, which we can 
express as 


t=— (11.20) 


where / is the mean free path of the electrons, the average distance the electrons 
travel between collisions, and v,,, is their average speed through the lattice. (Note 
that this speed is not the same as the drift speed, which is the small increment of 
speed that comes about from applying the electric field.) 

Let’s see how this theory compares with experimentally observed 
conductivities. 
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| Example 11.4 





Assuming the average distance traveled between colli- 
sions is roughly the distance between atoms, estimate the 
electrical conductivity of copper at room temperature. 


Solution 

The nearest neighbor spacing between atoms in a copper 
lattice is 0.256nm. If we treat the electron gas semi- 
classically, the average kinetic energy of an electron is 
3kT , and so the average speed of an electron at room 
temperature (kT = 0.0252 eV) is 


3kT 
an ae’ 


= / 3(0.0252 eV) 
~ V 0.511 x 106 eV/c2 








= 1.15 x 10° m/s 


The average time between collisions is then 


T =1/Vq, = (0.256 x 107? m)/(1.15 x 10° m/s) 
=2.22x10 hs 


The density of copper atoms is 


pNa _ (8.96 x 10° kg/m*)(6.02 x 1073 atoms/mole) 
M 0.0635 kg/mole 
= 8.49 x 1078 atoms/m? 


ie 





and the conductivity is 


net 





oz 
m 
(8.49 x 1078 m73)(1.60 x 1071? C)’(2.22 x 107} s) 
- 9.11 x 10-3! kg 
= 5,30 x 10° OQ"! m= 








conductivity of 


copper at room temperature is 
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5.96 x 107 Q-'m7!, so our calculation is off by more than an order of 
magnitude. Moreover, the temperature dependence is wrong: this calculation 
predicts that the conductivity should decrease as 7~'/? in this temperature region, 
while the measured conductivity decreases like T—!. 

We have actually made a couple of errors in this calculation, both of which 
have to do with ignoring the effects of quantum mechanics in the conduction 
process. Let’s see how we can remedy these defects. 


Quantum Theory of Electrical Conduction 


Figure 11.16a represents the Fermi distribution of electron velocities in the metal. 
Like the Fermi energy distribution, it is flat at v = 0 and it falls to zero near the 
Fermi velocity vp, but (unlike the energy distribution) it has positive and negative 
branches because the electrons can move in either direction. When an electric field 
is applied, all electrons acquire on the average an additional velocity component 
equal to the drift velocity v4, which shifts the entire velocity distribution to the 
left (opposite the field direction), as shown in Figure 11.166. Even though the 
entire distribution shifts, the net effect of applying the electric field is centered on 


FIGURE 11.16 (a) The Fermi- 
Dirac velocity distribution function. 
(6b) When an electric field is applied, 
the distribution shifts as the electrons 
are accelerated in a direction opposite 
to the field. 


a small number of electrons in the vicinity of vp. The electric field causes some 
electron states near vp in the direction of E to become unoccupied, while an equal 
number of states near v, in a direction opposite to E become occupied. 

The only electrons affected by the field are those in a narrow interval near the 
Fermi energy. These electrons are moving with a speed vp = ,/2E,/m, which 
for copper with E; = 7.03eV works out to be 1.57 x 10° m/s. This speed is 


about an order of magnitude larger than the speed we found in Example 11.4, 
which would give us a shorter average time between collisions and hence a 
smaller conductivity. This seems to make the disagreement between theory and 
experiment even worse! Moreover, the Fermi energy is nearly independent of 
temperature, so the speed of electrons near the Fermi energy should likewise not 
change with temperature, nor should the conductivity. 

That leaves the mean free path / as our last resort in fixing the calculation. 
However, in a perfectly arranged lattice, the mean free path should be infinite! 
Because atoms are mostly empty space, an electron should have a clear path 
through the material without scattering from the lattice ions. A perfect lattice 
should have an infinite conductivity! In practice, we find that the mean free path 
of an electron may be many hundreds of times the spacing between atoms, so that 
an encounter of an electron with a lattice ion is not very common. 

In a real metallic lattice, two effects contribute to the scattering of electrons: 
(1) the atoms are in random thermal motion (oscillating about their equilibrium 
positions) and therefore do not occupy exactly the positions of a perfectly 
arranged lattice, and (2) lattice imperfections and impurities cause deviations 
from the ideal lattice. The first effect is temperature dependent and dominates at 
high temperatures; the second effect is independent of temperature and dominates 
at the lowest temperatures. In fact, because the average vibrational potential 
energy (which depends on the square of the vibrational amplitude) is proportional 
to the temperature, the average area that a vibrating atom presents to an electron 
moving through the lattice is also proportional to 7. Therefore the conductivity 
decreases like T~', in agreement with observations. 

Figure 11.17 shows the resistivity (the inverse of conductivity) of sodium metal 
as a function of the temperature. You can see the temperature-independent part 
at low temperature and the temperature-dependent part at higher temperatures 
(which increases linearly with T). 

The same principles that govern electrical conductivity in a metal also govern 
thermal conductivity. Heat entering the material causes electrons in a small interval 
(of width k7) near the Fermi energy to move more rapidly, and those electrons can 
transfer their energy to the lattice in collisions with the ions. Assuming the mean 
free paths for electrical conduction and thermal conduction are the same, the ratio 
of the thermal conductivity to the electrical conductivity should be independent 
of the material but should depend only on the temperature (because the interval 
kT determines the number of electrons available for thermal conduction). 

The ratio K/oT (where K is the thermal conductivity) should be the same for 
all materials and all temperatures. The proportionality of the thermal and electrical 
conductivities is known as the Wiedemann-Franz law. The ratio K/oT can be 
calculated from the parameters of the electron gas model to be 


L=7k /3e = 2.44 x 10° W- QIK? (11.21) 


which is called the Lorenz number. Figure 11.18 shows the ratio K/oT for a 
variety of metals at room temperature. In this region the thermal and electrical 
conductivities vary by nearly two orders of magnitude, but the ratio remains fairly 
constant and agrees with the Lorenz value. This agreement is another successful 
application of Fermi-Dirac statistics to the properties of electrons in solids. 
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FIGURE 11.17 The electrical resis- 
tivity of sodium metal as a function of 
the temperature. 
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FIGURE 11.18 The Wiedemann- 
Franz ratio K/oT for various metals 
at room temperature. The solid line is 
the Lorenz number. 
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11.4 BAND THEORY OF SOLIDS 


The model of treating a conductor as a gas of free electrons has taken us a 
long way toward understanding the properties of materials, but to gain a deeper 
understanding we must consider the interaction of the electrons with the atoms of 
the lattice. We’ll see that this interaction leads to a profound difference between 
the electron energies of the free electron gas (continuous from zero up to the 
Fermi energy) and the electron energies in an interacting system (an alternating 
series of allowed and forbidden energy regions). 

When two identical atoms, such as sodium, are very far apart, the electronic 
levels in one are not affected by the presence of the other. The 3s electron of each 
atom has a single energy with respect to its nucleus. As we bring the atoms closer 
together, the electron wave functions begin to overlap, and two different 3s levels 
form, depending on whether the two wave functions add or subtract. This effect 
is responsible for molecular binding, as discussed in Section 9.2. Figure 11.19 
shows a representation of the energy levels. 

As we bring together more atoms, the same sort of effect occurs. When the 
sodium atoms are far apart, all 3s electrons have the same energy, and as we begin 
to move them together, the energy levels begin to “split.” The situation for five 
atoms is shown in Figure 11.20. There are now five energy levels that result from 
the five overlapping electron wave functions. As the number of atoms is increased 
to the very large numbers that characterize an ordinary piece of metal (perhaps 
10°? atoms), the levels become so numerous and so close together that we can no 
longer distinguish the individual levels, as shown in Figure 11.21. We can regard 
the V atoms as forming an almost continuous band of energy levels. Because those 
levels were identified with the 3s atomic levels of sodium, we refer to the 3s band. 

Each energy band in a solid with N atoms has a total of N individual levels. 
Each level can hold 2(2/+ 1) electrons (corresponding to the two different 
orientations of the electron spin and the 2/+1 orientations of the electron orbital 
angular momentum) so that the capacity of each band is 2(2/ + 1)N electrons. 

Figure 11.22 shows a more complete representation of the energy bands in 
sodium metal. The ls, 2s, and 2p bands are each full; the 1s and 2s bands each 
contain 2N electrons and the 2p band contains 6N electrons. The 3s band could 
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FIGURE 11.19 Splitting of 3s level FIGURE 11.20 Splitting of 3s level FIGURE 11.21 Formation of 3s band 
when two atoms are brought together. when five atoms are brought together. by a large number of atoms. 


accommodate 2N electrons as well; however, each of the N atoms contributes 
only one 3s electron to the solid, and so there is a total of only N 3s electrons 
available. The 3s band is therefore half full. Above the 3s band is a 3p band, 
which could hold 6N electrons, but which is completely empty. 

The situation we have described is the ground state of sodium metal. When 
we add energy to the system (thermal or electrical energy, for example), the 
electrons can move from the filled states to any of the empty states. In this case, 
electrons from the partially full 3s band can absorb a small amount of energy and 
move to empty 3s states within the 3s band, or they can absorb a larger amount 
of energy and move to the 3p band. 

We can describe this situation in a more correct way using the Fermi-Dirac 
distribution. At a temperature of T = 0K, all electron levels below the Fermi 
energy Ey are filled and all levels above the Fermi energy are empty. In the case 
of sodium, the Fermi energy is in the middle of the 3s band, because all electron 
levels below that energy are occupied (Figure 11.23). At higher temperatures 
the Fermi energy gives the level at which the occupation probability is 0.5; the 
Fermi energy does not change significantly as we increase the temperature, but 
the occupation probability of the levels above Ey is no longer zero. Figure 11.24 
shows a situation in which the thermal excitation of electrons leads to a small 
population of the 3p band and some vacant states in the 2p band. 

Sodium is an example of a substance that is a good electrical conductor. When 
we apply a very modest potential difference, of the order of 1 V, electrons can 
easily absorb energy because there are N unoccupied states within the 3s band, all 
within an energy of about | eV. Electrons absorb energy as they are accelerated 
by the applied voltage, and they are therefore free to move as long as there are 
many unoccupied states within the accessible energy range. In sodium there are 
N relatively free electrons that can easily move to N unoccupied energy states, 
and sodium is therefore a good conductor. 
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sodium at T = 0 (the filled 1s 
and 2s bands are not shown). 
The Fermi energy is at the center 
of the half-filled 3s band. Note 
that the Fermi-Dirac distribution 
function is drawn with the energy 
axis vertical. 
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FIGURE 11.22 Energy bands in so- 


dium metal. 
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FIGURE 11.24 Energy bands in 
sodium at T > 0. The 2p band is 
no longer completely full (there 
are a few vacant states near the 
top), and the 3p band is no longer 
completely empty. 
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FIGURE 11.25 When E,> kT, 
there are no electrons in the conduc- 
tion band. This situation characterizes 
an insulator. 
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FIGURE 11.26 Band structure of a 
semiconductor. The gap is much 
smaller than in an insulator, so there 
is now a small population of the con- 
duction band. 
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FIGURE 11.27 — Band structure in 
magnesium. The filled 3s and empty 
3p bands overlap, forming a single 
partially filled band. 


The band structure of sodium, in which the Fermi level lies in the middle of a 
band, is characteristic of many good electrical conductors. A completely different 
situation occurs when the Fermi level lies in the gap between two bands, so that 
the band below Ef is completely full and the band above is completely empty. If 
the gap energy £, is large compared with kT, then even though the Fermi-Dirac 
distribution spreads as the temperature is raised, it doesn’t spread enough to result 
in a significant population of states in the upper band (called the conduction band) 
or a significant number of empty states in the lower band, called the valence band. 
This situation is shown in Figure 11.25. There are many electrons in the valence 
band available for electrical conduction, but there are few empty states for them to 
move through, so they do not contribute to the electrical conductivity. There are 
many empty states in the conduction band, but at ordinary temperatures there are 
so few electrons in that band that their contribution to the electrical conductivity 
is also very small. These substances are classified as insulators and in general 
they have two properties: a large energy gap (a few electron-volts) between the 
valence and conduction bands, and a Fermi level that is in the gap between the 
bands (i.e., a filled valence band and an empty conduction band). 

A material with the same basic structure but a much smaller energy gap 
(1eV or less) shows quite a different behavior. These materials are known as 
semiconductors. Figure 11.26 shows a representation of such a substance at 
ordinary temperatures. There are now many electrons in the conduction band, and 
of course many empty states accessible to them, so that they can conduct relatively 
easily. There are also many empty states in the valence band, so that some of the 
electrons in the valence band can also contribute to the electrical conductivity 
by moving about through those states. We consider these two mechanisms of 
electrical conduction in detail in Section 11.6. For now we note two characteristic 
properties of semiconductors that relate directly to the band structure as shown 
in Figure 11.26. (1) Because thermal excitation across the gap is relatively 
probable, the electrical conductivity of semiconductors depends more strongly 
on temperature than the electrical conductivity of insulators or conductors. (2) It 
is possible to alter the structure of these materials, by adding impurities in very 
low concentration, in such a way that the Fermi energy changes and may move 
up toward the conduction band or down toward the valence band. This process, 
known as doping, can have a great effect on the conductivity of a semiconductor. 

In the examples we have discussed so far, it is not apparent why the band theory 
is so useful in understanding the properties of a solid. Sodium, for example, is 
expected to be a good conductor based on its atomic properties alone (a relatively 
loosely bound 3s electron); on the other hand, solid xenon has only filled atomic 
shells and should be a poor conductor. These conclusions follow either from 
simple atomic theory or from band theory. However, there are many cases in 
which atomic theory leads to wrong predictions while band theory gives correct 
results. We consider two examples. (1) Magnesium has a filled 3s shell, and on 
the basis of atomic theory alone we expect it to be a poor electrical conductor. 
It is, however, a very good electrical conductor. (2) The 2p shell of carbon has 
only two electrons of the maximum number of six. Carbon should therefore be a 
relatively good conductor; instead it is an extremely poor conductor. 

We can understand both of these materials based on the unusual way the bands 
of these solids behave when the atoms are close enough so that the band gap 
disappears and the bands overlap. In magnesium (Figure 11.27), for example, the 
(filled) 3s and (empty) 3p bands overlap, and the result is a single band with a 
capacity of 2N + 6N = 8N levels. Only 2N of those are filled, and so magnesium 


behaves like a material with a single band filled only to one-fourth its capacity. 
Magnesium is therefore a very good conductor. 

In carbon, the overlap of the electronic wave functions at close range first 
causes mixing of the 2s and 2p bands, in a way similar to magnesium; a single band 
is created with a capacity of 8N electrons (Figure 11.28). The 2s states contribute 
2N electrons, and the 2p states contribute another 2N (out of a maximum capacity 
of 6N). As the atoms approach still closer, the band divides into two separate 
bands, each with a capacity of 4N electrons. Because carbon has four valence 
electrons (two 2s and two 2p), the lower 4N states are completely filled and the 
upper 4N states of the conduction band are completely empty. Carbon is therefore 
an insulator. Germanium and silicon have the same type of structure as carbon, 
but their equilibrium separation is greater, so the gap between the valence and 
conduction bands is smaller, about | eV; it is this feature that causes Ge and Si to 
be semiconductors. 


*Justification of Band Theory 


The band theory of solids has had great success in accounting for the properties 
of metals, insulators, and semiconductors. In this section we consider a different 
approach to band theory that is based on the quantum mechanics of an electron 
moving through a lattice of ions. In analogy with solutions to the Schrédinger 
equation discussed in Chapter 5, in which an electron in a potential energy well 
shows discrete energy levels, we will see that an electron in a periodic potential 
energy provided by a lattice of ions can show energy bands. 

To simplify the problem we consider only a one-dimensional lattice of ions 
(Figure 11.29). The electron is represented by a de Broglie wave traveling through 
the lattice. The interaction between the electron and the lattice can be represented 
as a scattering problem, similar to Bragg scattering (Section 3.1). The Bragg 
condition for scattering is 


2dsin@=nd  (n=1,2,3,...) (11.22) 


where d is the atomic spacing and 6 is the angle of incidence measured from the 
plane of atoms (not from the normal). In a two-dimensional lattice, the incident 
wave can be scattered in many different directions, depending on the plane where 
we imagine the reflection to occur (recall Figure 3.6); in one dimension, however, 
only one possible reflection can occur—the incident wave can be reflected back 
in the opposite direction. We can use the Bragg condition for this case, with d = a 
(the spacing between the ions or atoms of the lattice) and 0 = 90° (the angle 
between the “reflecting plane” and the incident wave). With 2a sin 90° = nd and 
A = 2m/k (where k is the wave number), we find 


kone (11.23) 
a 

For wave numbers that do not satisfy this condition, the electron propagates 
freely through the lattice and behaves like a free particle whose energy is only 

kinetic: 
2 ia k2 
| pe a a 
2m 2m 


(11.24) 


*This is an optional section that may be skipped without loss of continuity. 
Pp y pp y 
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FIGURE 11.28 Band structure of car- 
bon (n= 2), silicon (n=3), and 
germanium (n = 4). The combined 
ns + np band splits into two bands, 
each of which can hold 4N electrons. 
The atomic separation of carbon gives 
it a gap of about 7 eV and makes it an 
insulator, while the larger separation 
of silicon and germanium results in a 
smaller gap of about 1 eV and makes 
them semiconductors. 
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FIGURE 11.29 One-dimensional 
Bragg scattering. The only possible 
scattering is a reflection back in the op- 
posite direction. 
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FIGURE 11.30 The parabolic rela- 
tionship between energy and wave 
number for a free particle. 
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FIGURE 11.31 Probability densities 
for two different standing waves in the 
one-dimensional lattice. 
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FIGURE 11.32 The relationship be- 
tween energy and wave number for 
a one-dimensional lattice. The dashed 
curve is the parabola that represents 
the free particles. The solid curves rep- 
resent waves scattered by the lattice. 


There are no restrictions on k, so all values of E are allowed. This relationship 
between £ and k for free electrons defines a parabola, as shown in Figure 11.30. 

For wave numbers that satisfy the Bragg condition, the reflected and incident 
waves add to produce standing waves, which always result when we superpose 
two waves of equal wavelengths traveling in opposite directions. Depending on 
the phase difference between the waves, their amplitudes can add or subtract, so 
two different possible standing waves can result. Their probability densities are 
shown in Figure 11.31. For one of the waves (y,), the electrons are more likely 
to be found close to the positive ions; these electrons are more tightly bound to 
the lattice —the energy of the electron is a bit lower than that of the free electron, 
due to the negative potential energy between the electron and the ions. Electrons 
represented by the other wave (yw) are most likely to be found in the region 
between the ions; they are less tightly bound, so their energies are a bit above 
those of the unscattered electrons (for which the probability density is flat, so they 
are equally likely to be found at any location). 

The resulting dependence of the energy of the electrons on the wave number k 
is illustrated by the S-shaped curve segments in Figure 11.32. For wave numbers 
that are far from satisfying the Bragg condition (that is, values of k that are not 
close to nz/a), the curve segments overlap the dashed parabola representing the 
free particle. Close to the wave numbers that satisfy the Bragg condition, however, 
the energy deviates from that of the free particle, a bit below the parabola for 
the more tightly bound electrons that spend more time near the ions and a bit 
above the parabola for the less tightly bound electrons that are more likely found 
between the ions. 

Notice from Figure 11.32 that, even though all values of k are permitted, 
there are certain allowed bands of energy values separated by forbidden gaps. An 
electron traveling in this lattice is permitted to have energies only in the regions 
corresponding to the allowed bands. This indicates how a periodic array of atoms 
results in energy bands. 

A more detailed calculation in three dimensions gives a better representation 
of the allowed and forbidden bands, but you can see that even this basic one- 
dimensional model shows how the bands can arise from the interactions of the 
electrons with a periodic lattice. 


11.5 SUPERCONDUCTIVITY 


At low temperatures, the resistivity of a metal (the inverse of its conductivity) is 
nearly constant. As the temperature ofa material is lowered, the lattice contribution 
to the resistivity decreases while the impurity contribution remains approximately 
constant, and as we approach T = 0 K the resistivity should approach a constant 
value. Many metals, known as normal metals, behave in this way, as illustrated 
in Figure 11.17. 

The behavior of another class of metals is quite different. These metals behave 
normally as the temperature is decreased, but at some critical temperature 7, 
(which depends on the properties of the metal), the resistivity drops suddenly to 
zero, as shown in Figure 11.33. These materials are known as superconductors. 
The resistivity of a superconductor is not merely very small at temperatures below 
T.; it vanishes! Such materials can conduct electric currents even in the absence of 


Cc 
an applied voltage, and the conduction occurs with no i7R (joule heating) losses. 
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Superconductivity has been observed in 28 elements at ordinary pressures, in 
several additional elements at high pressure, and in hundreds of compounds and 
alloys. Since the original discovery of superconductivity in Hg in 1911, the focus 
of research has been to search for materials with the highest possible critical 
temperature, because many possible large-scale applications of superconductivity 
are presently impractical owing to the high cost of keeping materials below their 
critical temperatures. Table 11.6 summarizes some superconducting materials and T, 
their critical temperatures. You can see that before 1986, progress in raising the Temperature 
critical temperature was very slow, but since 1986 dramatic and rapid increases 
in 7, have been achieved. FIGURE 11.33 Resistivity of a 

Conspicuously absent from the list of superconductors are the best metallic superconductor. 
conductors (Cu, Ag, Au), which suggests that superconductivity is not caused 
by a good conductor getting better but instead must involve some fundamental 
change in the material. In fact, superconductivity results from a kind of paradox: 
ordinary materials can be good conductors if the electrons have a relatively weak 
interaction with the lattice, but superconductivity results from a strong interaction 
between the electrons and the lattice. 

Consider an electron moving through the lattice. As it moves, it attracts the 

positive ions and disturbs the lattice, much as a boat moving through water creates 
a wake. These disturbances propagate as lattice vibrations, which can then interact 
with another electron. In effect, two electrons interact with one another through 
the intermediary of the lattice; the electrons move in correlated pairs that do 
not lose energy by interacting with the lattice. (These pairs are not necessarily 
traveling together through the lattice; they may be separated by a large distance.) 
In the absence of a net current, the members of a pair have opposite momenta; 
when a net current is established, both members of the pair acquire a slight 
increase in momentum in the same direction, and this motion is responsible for 
the current. 


Resistivity 











TABLE 11.6 Some Superconducting Materials 





















































Material T, (i) Gap (meV) Year 

Zn 0.85 0.24 1933 

Al 1.18 0.34 1937 

Sn Bef2 ails) 1913 

Hg 4.15 1.65 1911 

Pb Ta) 213 1913 

Nb O25 3.05 1930 

Nb3Sn 18.1 1954 
Nb,Ge Doe 1973 
La,Ba,_,CuO, 36 1986 
La,Sr,_,CuO; 40 1986 
YBa, Cu,0, 93 1987 
T1,Ba,Ca,Cu30j9 125 1988 
Hg). Tl; Ba3yCa39CuysOj7 138 1994 
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FIGURE 11.34 The number of filled electron states in (a) a normal conductor at 
T =0; (6) a normal conductor at T>0; (c) a superconductor at T = 0; (d) a 
superconductor at T > 0 but less than 7,. In (c), there is an energy gap of width £,, 
and states displaced from within the gap pile up on either side of the gap. At higher 
temperatures, as in (d), the gap is narrower and there are empty states below the gap 
and filled states above the gap. As the temperature is increased to T,, the gap width 
becomes 0 and the distribution of occupied states of the superconductor approaches 
that of the normal conductor. The gap width is exaggerated in the figure; generally, 
Bou 1- ay, 


According to the successful BCS theory of superconductivity,* below the 
critical temperature there is a small energy gap £, in the occupation probability of 
electrons in a superconductor (Figure 11.34). Below the gap, the electrons form 
pairs, which are known as Cooper pairs. Once a single Cooper pair forms, it is 
energetically favorable for other pairs to form, so the change from the normal 
state above 7, to the superconducting state below 7, is quite sudden. (As shown in 
Figure 11.34c, the population can exceed the limits imposed by the Fermi-Dirac 
distribution of at most one electron per quantum state. When the electrons are 
paired, they no longer behave like fermions and so it is possible to have more 
than one in each quantum state.) 

When a superconductor is cooled below 7,, the gap opens and Cooper pairs 
begin to form. As the material is cooled further, the gap widens. Values of the 
energy gap listed in Table 11.6 correspond to the limiting case as T — 0. The 
energy gaps, which can be regarded as representing the binding energy of a 
Cooper pair, are very small, of the order of 1077 eV. At T = 0, all states below the 
gap are occupied. When 0 < T < T,, there are some unoccupied states below the 
gap, and some states above the gap are occupied by normal (unpaired) electrons. 

It seems reasonable that there should be a direct relationship between the critical 
temperature and the energy gap: the larger the energy gap, the more thermal energy 
is required to break the Cooper pairs to destroy the superconductivity. The BCS 
theory gives this relationship: 


E, = 3.53KT, (11.25) 
*The theory of superconductivity was developed in 1957 by John Bardeen, Leon N. Cooper, and J. 


Robert Schrieffer, who were awarded the 1972 Nobel Prize in physics for their work. Bardeen also 
shared the 1956 Nobel Prize for his research on semiconductors and his development of the transistor. 


As the temperature is raised to 7,, the gap width decreases, and the superconduc- 
tivity disappears above 7, where the gap width becomes zero. 

Beginning in 1986, a new class of superconductors was discovered with unusu- 
ally high values of T,. In the 75 years from the discovery of superconductivity in 
1911 until 1986, the highest 7, had gone from 4 K to about 23 K. In 1986, several 
materials were discovered with 7, in the range 30—40 K. By 1987 it was up to 93 K, 
and it rose to 138 K in 1994. Crossing the boundary at 77 K is important, because 
it means that cooling can be accomplished with liquid nitrogen instead of liquid 
helium, which costs nearly an order of magnitude more than liquid nitrogen. The 
rapid increase in 7., has led to the hope that it might be possible to develop mate- 
rials that are superconductors at room temperature. Such materials could enable 
the transmission of electric power over long distances without resistive losses. 

The high-7, superconductors are oxides of copper in combination with other 
elements. They are ceramics, which means that they are rather brittle and not 
easily formed into wires to carry current. The crystal structure is characterized 
by planes of copper and oxygen between planes of the other elements. It seems 
likely that the superconductivity occurs in the copper oxide planes, but it is not 
yet clear that the complete explanation for these new superconductors is given by 
the BCS theory. 

Superconducting materials have many applications that take advantage of their 
abilities to carry electrical currents without resistive losses. Electromagnets can 
be constructed that carry large currents and therefore produce large magnetic 
fields (of order 5 to 10 T). Currents as large as 100 A can be carried by very fine 
superconducting wires, of order 0.1 mm diameter, and thus such magnets can be 
constructed in a smaller space, using less material, than would be possible with 
ordinary conductors. Once started, a current in a superconducting loop of wire 
can circulate for years with no external source to drive it. Superconducting wires 
are also used to produce magnetic fields in magnetically levitated trains and in 
magnetic resonance imaging, and also to bend beams of particles in high-energy 
accelerators, such as the Large Hadron Collider, which began operation in 2008. 


Josephson Effect 


Imagine a thin layer of insulating material sandwiched between two identical 
superconductors. The insulating layer is thin enough that the electron pairs can 
tunnel through from one superconductor to another. This is a typical case of 
one-dimensional barrier penetration, such as we discussed in Chapter 5. 

Figure 11.35 represents the arrangement. In the superconductors (regions 1 
and 3), the electron pairs move freely, so the wave functions are of the form of 
the free particle: w, (x) = det) and w(x) = Aei*+%3), where a, and a, are 
arbitrary phase angles (the amplitude A is taken to be real). At the two boundaries 
on either side of the barrier, the waves are yy, = Ae’*! and w, = Ae’®s, where ¢, 
and @; represent the values of the exponents at the boundaries. Inside the barrier 
of height Up, the wave function is of the form w(x) = Bek* + Ce~**, where 
ki = ,/2mU)/h2. Applying the two boundary conditions on wy, we ultimately 
obtain a current of the form 


i= ig sin(¢, — $3) (11.26) 


The existence of this “‘supercurrent” through the junction was first predicted by 
British physicist Brian Josephson in 1969, and it is now known as the Josephson 
effect. Josephson shared the 1973 Nobel Prize in physics for this discovery. 
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FIGURE 11.35 (a) A thin insulator 
sandwiched between two supercon- 
ductors provides a potential energy 
barrier of height Up to the flow of cur- 
rent. (b) There is a phase difference of 
¢, — $3 between the wave functions 
yw, and w; in the superconductors. 
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FIGURE 11.36 Two Josephson junc- 
tions can be combined to form a 
quantum interference device. 


One important application of the Josephson effect is in the measurement of 
very weak magnetic fields. Consider a device with two Josephson junctions, as 
shown in Figure 11.36. Under ordinary circumstances the current divides equally 
in the two branches, and the phase difference is the same across both junctions. 
Now suppose a magnetic field is applied perpendicular to the plane of the loop. 
An additional current is induced around the loop. This additional current, either 
clockwise or counterclockwise depending on the direction of the magnetic field, 
will add to the current in one of the Josephson junctions and subtract from the 
current in the other, causing a relative change in the phase differences across the 
two junctions. When the currents combine, this phase difference causes maxima 
and minima in the net current leaving the loop in a manner that is very similar 
to the maxima and minima in double-slit interference. Observing the maxima 
and minima serves as a sensitive measurement of the magnetic field in the loop. 
This device is called a SQUID (Superconducting QUantum Interference Device) 
and allows measurements of magnetic fields of less than 10~!7 T. Such sensitive 
devices allow precise mapping of the magnetic fields inside the brain and also find 
use in other medical procedures including magnetic resonance imaging (MRI). 

In another application, a DC voltage AV is applied across a single Josephson 
junction. This voltage changes the energy of an electron pair on one side of the 
junction by 2eAV. For example, if region | is made more positive than region 3, 
the wave function at the boundary in region 1 would become y, = Ael¥1+2e4/4/) 
using the usual procedure (see Eq. 5.6) for including the time dependence of the 
wave function (the factor e~’ with @ = E/h = —2eAV /h). The current through 
the junction then becomes 


2eAV 
i= igsin (4, ~ 65 + = 9 (11.27) 





Applying a DC voltage to the junction produces an AC current! Because frequen- 
cies can be measured very precisely, a measurement of the frequency of this AC 
current can be used to determine AV’. As a result, since 1990 the AC Josephson 
effect has been accepted by the General Conference on Weights and Measures as 
the international standard for the volt. 


11.6 INTRINSIC AND IMPURITY 


SEMICONDUCTORS 





A semiconductor is a material with an energy gap E, of order 1 eV between the 
valence band and the conduction band. At T = 0, all states in the valence band are 
full and all states in the conduction band are empty; recall that the Fermi-Dirac 
distribution is a step function at T = 0 and gives an occupation probability of 
exactly 1 for all states below Ey and exactly 0 for all states above E,. As the 
temperature is raised, however, some states above Ey are occupied and some states 
below £, are empty. At room temperature, the relationship between the Fermi 
energy, the valence and conduction bands, and the electron energy distribution 
might be as shown in Figure 11.26. 

Although the value of the room-temperature Fermi-Dirac distribution function 
is nearly zero in the conduction band, it is not exactly zero; Figure 11.37 shows 
a greatly magnified view of /gp(£) near the bottom of the conduction band. The 
value of E — Ey is about 0.5eV if Ep lies near the middle of the 1 eV energy 
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gap, and therefore E — E, >> kT, because at room temperature kT ~ 0.025 eV. 
The | in the denominator of the Fermi-Dirac distribution is therefore negligible, 
and fp (£) is approximately exponential, as shown in Figure 11.37. 

Assuming the Fermi energy to lie near the middle of the gap, the occupation 
probability near the bottom of the conduction band is of order ge = 197", 
Thus one atom in 10° contributes an electron to the electrical conductivity; 
compare this with a metal in which essentially every atom contributes an electron 
to the conductivity. (On the other hand, consider an insulator, which has a band 
structure very similar to that of a semiconductor, except the energy gap is perhaps 
5 eV instead of 1 eV. This small difference in the size of the energy gap has an 
enormous effect on the occupation probability of the conduction band at room 
temperature: e “e/*4” ~ 10-44, Thus in a sample containing of order 107° atoms, 
there may be 10!! conduction electrons in a semiconductor, 10° in a conductor, 
and none in an insulator.) 

Figure 11.38 shows the corresponding region near the top of the valence 
band. If there are a few filled states in the conduction band, there must be a 
few empty states in the valence band, and the Fermi-Dirac distribution is just a 
tiny bit smaller than 1; in fact it is approximately 1 — e@—r)//", This number 
is about 1—10~°, based on our discussion for the electrons in the conduction 
band. (Because all the electrons in the conduction band came originally from the 
valence band, the number of electrons in the conduction band is exactly equal 
to the number of vacancies in the valence band. The Fermi-Dirac distribution is 
therefore symmetric in the conduction and valence bands, so the Fermi energy 
must lie at the center of the gap.) 

In practice it is much easier to analyze the behavior of a relatively small number 
of vacancies in the valence band rather than the large number of electrons in that 
band. When we apply an electric field to the semiconductor, the electrons in the 
conduction band can move easily, because there are many empty states to move 
into. There are few vacancies in the valence band, however. Under the influence 
of the electric field, an electron in the valence band can move only if there is a 
vacancy nearby for it to move into. When that electron moves into the vacancy, 
it creates another vacancy, which can in turn be filled by another electron. In this 
way, electrons moving in one direction cause an apparent motion of the vacancy 
in the opposite direction. The situation is similar to the motion of cars in a parking 
lot with one vacancy (Figure 11.39). 

These vacancies in the valence band are known as holes, and they behave 
as if they have positive charges. In an electric field, electrons in the conduction 
band acquire a drift velocity in a direction opposite to the field (see Eq. 11.16) 
but (because electrons carry negative charge) they give a current opposite to the 
velocity and thus in the same direction as the field (see Eq. 11.17). The holes in 
the valence band acquire a velocity in the same direction as the field and give a 
current in the same direction as their velocity (that is, in the direction of the field). 
The current due to the electrons in the conduction band is therefore in the same 
direction as the current due to the holes in the valence band. 

The current in a semiconductor therefore consists of two parts: the negatively 
charged electrons in the conduction band and the positively charged holes in the 
valence band. Although the number of electrons in the conduction band is equal to 
the number of holes in the valence band, the two contributions to the current are in 
general not equal, because the electrons in the conduction band move more easily 
than the electrons in the valence band. Typically, the contribution of the electrons 
to the current at room temperature is about two to four times the contribution of 
the holes. 
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FIGURE 11.37 The tail of the Fermi- 
Dirac distribution function near the 
bottom of the conduction band. On the 
scale of this greatly magnified draw- 
ing, the | of fpp(Z) would be about 
1000km off to the right, and £, is 
about | m below the edge of this page. 
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FIGURE 11.38 The Fermi-Dirac dis- 
tribution function near the top of the 
valence band, showing the small frac- 
tion of empty states. 
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FIGURE 11.39 One car moves to the 
left, filling the vacancy but creating 
a new vacancy, which is then filled 
by the next car moving to the left. 
The motion of cars to the left, each 
filling a vacant space, is equivalent to 
the motion of the vacant space to the 
right. 














352 Chapter 11 | Solid-State Physics 











—- 
> 

































































FIGURE 11.40 (a) Covalent bonding 
in Si or Ge. Each atom provides four 
electrons for covalent bonds with its 
neighbors. (b) When a Si or Ge atom 
is replaced with a valence-5 atom 
(shaded), there is an extra electron that 
does not participate in covalent bonds. 
(c) Ifa Si or Ge atom is replaced by a 
valence-3 atom (shaded), one electron 
from a neighboring atom is not paired 
in a covalent bond. 


The material we have been describing thus far is an intrinsic semiconductor and 
is characterized by several features: (1) the number of electrons in the conduction 
band is equal to the number of holes in the valence band; (2) the Fermi energy lies 
at the middle of the gap; (3) the electrons contribute most to the current, but the 
holes are important also; (4) about 1 electron in 10° contributes to the conduction. 

Because only 1 electron in 10° contributes to the conductivity of an intrinsic 
semiconductor, the presence of impurities can significantly alter the conductivity 
of the semiconductor in a way that might not be easily controllable. However, 
if impurities with known properties are deliberately introduced into the semi- 
conductor in carefully controlled amounts, their contribution to the conductivity 
can be precisely determined. At impurity levels of only | part in 10° or 10’, the 
impurity contribution to the conductivity dominates the intrinsic contribution. 

Such materials are known as impurity semiconductors, and the process of 
introducing the impurity is known as doping. Impurity semiconductors can be of 
two varieties: those in which the impurity contributes additional electrons to the 
conduction band and those in which the impurity contributes additional holes to 
the valence band. 

Let us consider a material such as silicon or germanium, in which there are four 
valence electrons in hybrid orbitals. In the band theory view, these fill the 4N states 
of the valence band; in the atomic view the lattice is constructed so that each Ge 
or Si atom has four neighbors with which it shares an electron, and so all electrons 
participate in covalent bonding (Figure 11.40a). Now suppose we replace one of 
the Si or Ge atoms with an atom that has five valence electrons, such as phosphorus, 
arsenic, or antimony. Four of the five electrons form covalent bonds with the 
neighboring Si or Ge atoms, but the fifth electron is relatively weakly bound to 
the impurity atom and can be easily detached to contribute to the conductivity 
(Figure 11.405). Alternatively, we could replace one of the Si or Ge atoms with 
an atom that has three valence electrons, such as boron, aluminum, gallium, or 
indium. Its three valence electrons form covalent bonds with the neighboring Si 
or Ge (Figure 11.40c), but one of the surrounding atoms has an unpaired electron. 
Completing the four pairs of covalent bonds is energetically very favorable, so an 
electron is easily captured to complete the symmetry of the lattice. This creates a 
hole in the valence band and therefore contributes to the conductivity. 

On an energy-level diagram, the electron energies of these impurity atoms 
appear as discrete levels in the energy gap, either just below the conduction band 
(as in Figure 11.41a), or just above the valence band (as in Figure 11.415). The 
energy needed for these electrons to enter the conduction band, or for electrons 
from the valence band to fill the low-lying empty states, is relatively small, about 
0.01 eV in Ge and 0.05 eV in Si. As a result, even at room temperature (AT 
~ 0.025 eV) these excitations can occur easily. 

The energy levels formed by valence-5 impurities are known as donor states 
and the impurity is known as a donor, because electrons are “donated” to the 
conduction band. A semiconductor that has been doped with donor impurities is 
known as an n-type semiconductor, because the conductivity is due mostly to the 
negative electrons. 

The energy levels formed by valence-3 impurities are known as acceptor states, 
because they can “accept” electrons from the valence band. A material that has 
been doped with acceptor impurities is known as a p-type semiconductor, because 
the conductivity is due mostly to the positively charged holes. (Remember that 
n-type and p-type materials are both electrically neutral because they are made 
from neutral atoms. The designations n and p refer only to the charge carriers, not 
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FIGURE 11.42 In a semiconductor, 
the Fermi energy moves toward the 
middle of the gap as the temperature 
increases. 


FIGURE 11.41 (a) Energy lev- 
els of donor states. (b) Energy 
levels of acceptor states. 


to the material itself. Depending on whether we add or remove electrons, n-type 
and p-type materials can become either negatively or positively charged). 

At T = 0 the Fermi energy in n-type semiconductors lies between the donor 
states and the conduction band (remember, all states below Ey are full and all 
above Ey, are empty; at T= 0 the donor states are all occupied). In p-type 
semiconductors, the Fermi energy at 7 = 0 lies between the valence band and the 
acceptor states. As the temperature is raised, the thermal excitation of electrons 
from the valence band to the conduction band (as in an intrinsic semiconductor) 
causes the Fermi energy to move toward the center of the energy gap, as shown 
in Figure 11.42. For low doping levels and at a high enough temperature, the 
material may behave like an intrinsic semiconductor. 


11.7 SEMICONDUCTOR DEVICES 


The p-n Junction 


When a p-type semiconductor is placed in contact with an n-type semiconductor 
(Figure 11.43) electrons flow from the n-type material into the p-type material, 
until equilibrium is established. This equilibrium occurs when the Fermi energies 
in the two substances become identical. 

The resulting energy level diagram is shown in Figure 11.44. The region 
between the two materials is known as the depletion region, because it has been 
somewhat depleted of charge carriers. Electrons from the donor states of the 
n-type material fill the holes of the acceptor states of the p-type material. In this 
region the donor states do not provide electrons for the conduction band and the 
acceptor states do not provide holes in the valence band. 

Actually, these devices are not made by bringing two different materials into 
contact, but rather by doping one side of a material so that it becomes n type and 
the other side so that it becomes p type. The doping is carefully controlled, and 
typically depletion layers have a thickness of the order of 1 zm. 

The excess electrons that have entered the p-type material give that side of the 
depletion region a negative charge, which tends to repel additional electrons from 
the n region. There is a corresponding positive charge in the n region (because 
it has lost electrons to the p region). These charges are associated with the fixed 
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FIGURE 11.44 (a) A p-n junction. 
The electric field in the depletion 
region inhibits the additional flow of 
electrons. (b) Energy levels in p-n 
junction. Energetic electrons in the tail 
of the Fermi-Dirac distribution con- 
tribute to the diffusion current, while 
thermal excitation gives an equal and 
opposite drift current. 


ions in those regions; the acceptor atoms in the p region acquire an electron and 
become fixed sites of negative charge, while the donor atoms in the 7 region lose 
an electron and become fixed sites of positive charge. 

In equilibrium, enough negative charge builds up to stop the flow of electrons 
completely. There is a net electric field Ey in the depletion region that results in a 
force (in the opposite direction) on the electrons, preventing any further flow of 
charge. Equivalently, there is a potential difference AV) between the n-type and 
p-type regions; for electrons to flow from the 1 region to the p region, they must 
climb the energy barrier of height eAVp. 

In the tail of the Fermi distribution of electrons in the conduction band of the n 
region, there will be a small number of electrons with enough energy to climb the 
energy barrier and enter the p region, where they recombine with holes (that is, 
they “fall” from the conduction band of the p region into the valence band). This 
gives the diffusion or recombination contribution to the current, which is directed 
from the p region to the n region (the current direction always being opposite to 
the direction of electron flow). 

Even though holes provide the dominant contribution to the conduction in 
the p region, there are also electrons that provide a smaller contribution to the 
current. Electrons are thermally excited from the valence band in the p region to 
the conduction band, where they are accelerated by the electric field and travel 
into the n region. This gives the drift or thermal contribution to the current. At 
equilibrium, the two contributions to the current cancel one another, so that the 
net current is zero, as shown in Figure 11.44. 

Let us now apply an external voltage AV,,, across the junction so that the 
p-type material is made more positive than the n material; that is, we connect 
the + terminal of a battery to the p side of the junction and the — terminal of 
the battery to the m side (Figure 11.45). The effect of the battery is to Jower 
the energy hill by an amount eAV,,,. (The vertical axis shows electron energy, 
and a potential difference of AV.,, gives an electron energy of —eAV,,,.) This 
situation is called a forward voltage or forward biasing. The forward bias causes 
the depletion region to become narrower, because the battery pulls electrons out 
of the p region and injects them back into the 1 region. Because the energy hill 
is lower, more electrons can diffuse from the n region into the p region, so the 
diffusion current is considerably increased. (That is, there are more electrons in 
the n region in the tail of the Fermi distribution with energies above the bottom 
of the conduction band of the p region.) The drift current, however, is unaffected 
by the presence of the battery or the height of the hill. There is now a net current 
through the junction in the forward direction. 

Now we reverse the battery connections (Figure 11.46), a situation known as 
reverse voltage or reverse biasing. This raises the hill by the amount eAI,,,, 
widens the depletion region (because the battery pulls more electrons from the 1 
region and injects them into the p region), and decreases the diffusion current. 
The drift current is again unchanged, so now there is a relatively small net current 
in the reverse direction. 

Figure 11.47 shows the upper tail of the Fermi-Dirac distribution of the 
electrons extending into the conduction band of the n-type region. Only those 
electrons in the portion of the tail above the energy E, of the bottom of the 
conduction band of the p-type region can flow back across into the p-type region 
and it is these electrons that produce the diffusion current. The number of electrons 
in that tail above the energy E, is approximately 


N, = ne Ge Bp AE (11.28) 
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FIGURE 11.46 (a) A reversed- 
biased p-n junction. (b) The energy 
level diagram. The potential energy 
hill is larger, the diffusion current is 
smaller, and there is a small reverse 
current. 


FIGURE 11.45 (a) A forward-biased 
p-n junction. (b) The energy level 
diagram. The potential energy hill is 
smaller, the diffusion current is larger, 
and there is a net forward current. 


where 1 is some proportionality factor, and where we have approximated the 
Fermi-Dirac function as an exponential by neglecting the | in the denominator. 
(Because E, — Ey > leV and kT = 0.025 eV, this is an excellent approximation.) 
The diffusion current is proportional to N,, and because the drift and diffusion 
currents are equal, the drift current is also proportional to N,. Applying AV, 


ext 
changes the level £, to E, — eAV,,,, and the number of electrons in the tail above 
E, — eAV axt iS 


Ny = new Fee: Epy/AT 


(11.29) 
The diffusion current is now proportional to N,; applying the bias did not change 
the drift current, so it is still proportional to N,. The net current is given by the 
difference: 


io Ny — N, = ne Ge R)/AE (ge Vext/AT 1) (11.30) 
We can rewrite this expression as 
i = ig (e°AVex/*T _ 1) (11.31) 


This function is plotted in Figure 11.48, and it is immediately obvious why such 
p-n junctions, also known as diodes, have the property of rectifying varying 
currents. When the applied voltage is such that the junction is forward biased, a 
large forward current can flow. (When AV,,, = 1V, i= 2 x 10!7ip.) When the 
applied voltage is such that the junction is reverse biased, only a very small current 
can flow. (When AV,,, = —1V, i = —ip.) Even very small forward voltages can 
produce large forward currents; even very large reverse voltages can produce only 
small reverse currents. 
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FIGURE 11.47 The diffusion current 
depends on the number of electron 
states in the tail of the Fermi-Dirac 
distribution above the energy E, of 
the bottom of the conduction band in 
the p-type material. 
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FIGURE 11.48 Current-voltage char- 
acteristics of an ideal p-n junction. 
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FIGURE 11.49 Energy level diagram 
of a p-n junction under heavy doping. 
Electrons can tunnel across the narrow 
gap. 
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FIGURE 11.50 Current-voltage char- 
acteristics of a tunnel diode. The 
dashed curve shows the characteristics 
of an ordinary p-n junction diode. 
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FIGURE 11.51 Energy bands in a 
diode laser. The active region has a 
smaller gap than the n-type and p-type 
regions on either side. 


The Tunnel Diode 


When the p and regions are very heavily doped, the depletion layer becomes much 
narrower, perhaps 10 nm, and the energy diagram might look like Figure 11.49. 
When a small forward bias is applied, there is now a third contribution to the 
current—an electron from the conduction band of the n region can “tunnel” 
through the forbidden region directly into the valence band of the p region. This 
process of course depends on the wave nature of the electron and is an example of 
the type of barrier penetration we have discussed previously, in Section 5.6. The 
narrow depletion layer makes the process possible. The wavelength of an electron 
near the Fermi surface is about | nm, and ifthe thickness of the depletion layer were 
many orders of magnitude larger than this, tunneling would be unlikely to occur. 
As the forward voltage is increased, the potential hill is lowered, and soon it no 
longer becomes possible for an electron to tunnel directly through the forbidden 
region. For a voltage of a few tenths of a volt, the tunneling current becomes 
zero. At this point the tunnel diode behaves like an ordinary diode. Figure 11.50 
illustrates the characteristic current-voltage relationship for a tunnel diode. 
Tunnel diodes are useful in electric circuits as high-speed elements, because 
the characteristics of the device can change as rapidly as the bias voltage can be 
changed. They can also be used as switches. If we were to pass current through the 
tunnel diode so that we were on the peak of the characteristic curve, a small increase 
in the current would cause the voltage to jump suddenly to a much larger value. 


Photodiodes 


A photodiode is a p-n junction whose operation involves the emission or absorption 
of light. These devices operate on principles similar to ordinary atoms. An electron 
in the valence band can absorb a photon and make a transition to the conduction 
band. Photons of visible light have energies of order 2 to 3 eV, so a semiconductor 
with its gap of order | eV is just right for such a transition. Conversely, an excited 
electron from the conduction band can drop back down to the valence band, 
emitting a photon in the process. 

A common device that emits visible light is the LED, or light-emitting diode. 
An external current supplies the energy necessary to excite electrons to the 
conduction band, and when the electrons fall back down to recombine with holes, 
a photon is emitted. The energy is of course equal to the difference in energy of the 
electronic states. By varying the chemical composition, it is possible to produce 
LEDs emitting any color of visible light. LEDs find wide use as indicator lights and 
in video displays, including televisions and computer monitors. Broad-spectrum 
LEDs emitting white light are used in environmental lighting. 

It is possible for photodiodes to operate in reverse, in which an incoming 
photon is absorbed in the depletion region and produces an electron-hole pair. An 
electric field sweeps up the electron-hole pairs and produces an electric signal. 
Such devices are used to produce electric current (as in a silicon solar cell) or to 
count photons (as in light meters for cameras or detectors of X rays or gamma 
rays in space probes). 

Figure 11.51 shows another application of the emission of light by a semi- 
conductor, in this case a diode laser or semiconductor laser. A thin layer of 
semiconducting material is sandwiched between n-type and p-type regions having 
a slightly larger energy gap. Electrons are injected from an external circuit into the 
n-type material, from which they diffuse into the middle layer. The electrons are 
prevented from diffusing into the p-type layer by a potential barrier, so they tend 
to concentrate in the middle layer. In a similar fashion, holes are injected into the 
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p-type layer and again concentrate in the middle layer. This creates a population 


: ea wae : : , : Cleaved 
inversion similar to that described in our discussion of the laser in Section 8.7. An mirror 
electron drops into the valence band accompanied by the emission of a photon; face 


that photon then induces other transitions leading to the avalanche of photons that 
gives the lasing action. 

The physical construction of a typical diode laser is illustrated in Figure 11.52. 
The lasing material is a narrow (0.2 4m) layer of GaAs, and the p-type and n-type 
layers are GaAlAs a few jum in thickness. The ends of the material are cleaved 
to create mirrorlike surfaces that reflect a portion of the light wave, enhancing 
stimulated emission in the active region. This device emits at a wavelength of 
840 nm, in the near infrared region. Diode lasers at this wavelength are commonly 
used in communication to send signals along optical fibers. By varying the 
materials of the laser, it is possible to obtain visible radiation in almost any color. 
Diode lasers find common use in bar-code scanners and in players for CDs and 
DVDs. There are also many medical uses, for example in laser surgery. 

Diode lasers are of small size, and they consume very little power (typically 
10 mW, compared with the standard HeNe laser that may consume several watts). 
As aresult, diode lasers can be powered by ordinary batteries. The light signal can 
be turned on or off in switching times that are characteristic of semiconductors FI|GURE 11.52 <A diode laser. The 
(< 100 ps), and thus we have a device that can rapidly modulate the beam. Even  |asing action occurs in the thin GaAs 
though conventional lasers are capable of performing many of the same functions _ ayer, 
as diode lasers, the small size, low cost, low power consumption, and rapid switch- 
ing times make diode lasers the superior choice for most low-power applications. 


11.8 MAGNETIC MATERIALS 


Our final example of the application of quantum physics to solids concerns 
the magnetic behavior of materials. The magnetic susceptibility of atoms was 
discussed briefly in Section 8.4 (see Figure 8.10). Most atoms have permanent 
magnetic dipole moments, due either to the spin or orbital angular momentum 
of the electrons (or both). Ordinarily, these magnetic moments point in random 
directions, so the net total magnetic dipole moment of a sample of the material 
is zero. However, when a magnetic field is applied, the magnetic moments rotate 
into partial or full alignment with the applied field, and the vector sum of the 
dipole moments gives the material a net magnetization. Specifically, the total 
magnetization M is defined as the sum of all the individual atomic magnetic 
dipole moments 1; per unit volume: 





mirror 
face 





Li; 
M= = 
V 
where the sum is carried out over all the N individual particles (atoms, for 
example) in the material. 

Over a fairly wide range of applied magnetic fields, in many materials we 
find that the net magnetization is directly proportional to the applied field B,,,. 
That is, the stronger the applied field, the more the individual magnetic moments 
rotate into alignment with the field. (Clearly this proportionality cannot continue 
indefinitely as the field strength increases, because eventually all dipoles will be 
aligned with the field and further increases in the field will have little or no effect.) 


(11.32) 
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FIGURE 11.53 The dashed lines 


show the number of electrons in the 
Fermi-Dirac distribution drawn sepa- 
rately for spin up (top) and spin down 
(bottom). In an applied magnetic field, 
the energies of the spin-up electrons 
increase and the energies of the spin- 
down electrons decrease. To maintain 
the same Fermi energy with the field 
on, the spin-up electrons in the shaded 
strip flip their spins and move into the 
equivalent area with spin down. 


The proportionality constant between the magnetization and the applied field 
is called the magnetic susceptibility x: 


LoM = xBapp (11.33) 
For many materials, x is small (10~> to 107!) and positive. These materials are 
called paramagnetic. In other materials, x is observed to be negative (that is, 
the direction of the net magnetization is opposite to the direction of the applied 
field). These materials, which are called diamagnetic, usually have no permanent 
atomic magnetic moments, often because they have only paired electrons (as 
for example the inert gases); their magnetic behavior is due to a slight change 
in the orbital motion of the atomic electrons in response to the applied field. 
Diamagnetism is ordinarily a very weak effect, with susceptibilities in the range 
of —10~> to —10~*. (The effect responsible for diamagnetism, the alteration of the 
orbital motion of the electrons, can also occur in paramagnetic materials, but it is 
generally much weaker than the paramagnetism. However in some materials, such 
as copper, the paramagnetism is so weak that the diamagnetism is dominant.) In 
yet other materials, the magnetization remains after the applied field is removed. 
These include the ferromagnetic substances, and the susceptibility is undefined 
for ferromagnets. 

Magnetic effects are often strongly temperature dependent. In fact materials 
can change from ferromagnetic to paramagnetic as the temperature is increased. 


Paramagnetism of Electron Gas 


Let’s begin by investigating the magnetic behavior of an electron gas. When 
we apply a magnetic field to an electron gas, the energy of an electron in the 
field is E = —p,-B,,,, where fl, = —(e/m)s is the spin magnetic moment of 
the electron. The energy of the interaction of an electron with the field is then 
(assuming the field is in the z direction) 








5 2 ess e e 

= Hs: Bay = 7° ‘Bay = a S-Bapp = - MABoyy = EL-BBapp (11.34) 
with m, = +!/2. The symbol pg represents the Bohr magneton ef/2m (see 
Eq. 7.23). The electrons with m, = +'/2 gain energy pB,p, and those with 
m, = —'/2 lose an equal amount of energy. 


Figure 11.53 shows the Fermi-Dirac distribution of populated electron states 
separately for the spin-up and spin-down electrons (half of the electrons are in 
each group). All of the spin-up electrons move up in energy by /4pB,,, and all of 
the spin-down electrons move down. Because the two groups of electrons are in 
contact, the higher-energy electrons in the shaded region with spin up flip their 
spins and fill the vacant energy states in the spin-down group until the two groups 
equalize their Fermi energies. As a result, there is an excess of electrons with spin 
down in a strip of width AZ = 2,8,,,,. The number of electrons in the strip 
is AN = $Vg(E)fep (E) AE, where the factor of 1/2 comes from the fact that the 
spin-up and spin-down distributions each have !/2 of the total number of electrons. 
The drawing of Figure 11.53 has been exaggerated; the strips are very narrow 
compared with Ep, and we assume that we are at a reasonably low temperature in 
which the Fermi-Dirac distribution is fairly sharp, so we can evaluate the density 
of states at E = E, and take f-p = 1: 


AN = N(\) — N(t) = VeEpupB (11.35) 


app 


From Eq. 11.32, the z component of the magnetization is V~! )° w;, = V-'wpAN, 

so 

39 He, N 
ual lal (11.36) 

2Ep V 


UoM 
x= = = Mota Oe) = 
app 








using g(Ep) = 3N/2VE,. This calculation was first done by Wolfgang Pauli, and 
the result is often called the Pauli paramagnetic susceptibility. Note that N/V in 
this equation is the number of free electrons per unit volume, which might differ 
from the number of atoms per unit volume in materials in which there is more 
than one valence electron per atom. 

Equation 11.36 can be used to calculate values for the susceptibility of 
materials in which the electrons behave like a gas of fermions. However, 
comparing calculated susceptibility values with experimental values can often be 
challenging because of the many different units that are used. On the surface, 
it appears that the susceptibility is a dimensionless quantity, but in fact it can 
be calculated in many different ways: susceptibility per unit volume (as in 
Eq. 11.36) or per unit mass or per mole. Furthermore, it can be expressed in 
either cgs units or SI units (our choice). The tabulated values in the literature are 
often given as the molar susceptibility in cgs units. The conversion procedure is 
TO tse — ATX ese = 47(p/M) ee where p is the density of the solid and M@ 
is its molar mass (don’t confuse this with the magnetization! ). With the conversion 
written in this way, p and M must be in cgs units. 

With that warning in mind, let’s look at how Eq. 11.36 compares with 
experiment. Table 11.7 shows a few calculated and measured values. The 
agreement is surprisingly good, particularly in view of our omitting a number of 
important effects from the calculation, including a diamagnetic correction. For 
many metals, the diamagnetic contribution is larger than the Pauli paramagnetism, 
and as a result their susceptibility is negative (copper and gold are examples). 
The diamagnetic contribution is important because the Pauli susceptibility is so 
small (only a small fraction of the electrons near the Fermi energy contribute). 
Metal ions in salts typically have paramagnetic susceptibilities that can be several 
orders of magnitude larger, as we discuss next. 

It is also interesting to note that Eq. 11.36 predicts that the susceptibility of 
these substances should be independent of temperature. Raising the temperature 
should broaden the distribution of both the spin-up and spin-down distributions 
(Figure 11.53) by about the same amount, making a negligible contribution to 
the susceptibility. Indeed, the susceptibility of these metals has only a very weak 
dependence on temperature. In sodium, for example, the susceptibility changes 
by only a few percent between room temperature (300 K) and liquid helium 
temperature (4 K). 


TABLE 11.7 Pauli Magnetic Susceptibility of Some Solids 
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Experimental susceptibility (x 10~°) 
Element cgs, per mole SI, per volume Calculated susceptibility (x 10-°) 
Al 16.5 20.7 NS 7 
K 20.8 Site! 6.2 
Mg 13a 11.8 122 
Na 16 8.5 8.2 
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| Example 11.5 





(a) Compute the Pauli paramagnetic susceptibility of Mg. 
(b) The experimental value of the molar susceptibility is 
13.1 x 107° in cgs units. What is the value of the volume 
susceptibility in SI units? 

Solution 

(a) For Mg (which has valence 2), 


N 2pNqa 
VM 
2(1.74 x 10° kg/m?)(6.02 x 1073 atoms/mole) 
~ 0.0243 kg/mole 


= 8.62 x 1078 m3 








The susceptibility is 


_ 3Holta N 
2E, V 


To see how the units cancel, it is helpful to realize that 
the magnetic moment has units of either J/T or A- m? (the 
latter coming from the definition 4 = iA for the magnetic 
moment of a current 7 in a loop of area A). 


(d) 
cgs 
molar 


SI p 
Xvolume — Ans x 


1.74 g/cm? 
TT ——————- 
24.3 g/mole 


= 11.8 x 10~° 


(13.1 x 107°) 


Note that the density and molar mass are in cgs units for 
the conversion. 


_ 3(4m x 1077 T-m/A)(9.27 x 1074 J/T)?(8.62 x 1078 m=) 





2(7.13 eV)(1.602 x 10-!9 J/eV) 
= 12.2x 10° 





Paramagnetism of Atoms and lons 


Instead of the free electrons, let’s consider the contribution of the atoms or ions 
to the paramagnetism. We’ll represent the effective electronic spin of each atom 
by J, which might represent the total intrinsic spin S of the electrons in the atom, 
their total orbital angular momentum L, or a combination of both (the nuclear 
spin is excluded, because nuclear magnetic effects are negligible compared with 
electronic magnetic effects). Depending on the number of electrons in the atom, 
J might be integral or half-integral. 

The angular momentum J has all of the usual properties of quantum angular 
momentum. It has z component J, = m,h, where m, runs from —J/ to +J in integer 
steps. For any J, there are 2/ + 1 possible values of m,. For example, if J = 3/2, 
then m; = —3/2, —'/2, + 1/2, +3/2. Associated with this angular momentum there 
is an effective magnetic moment ft = —g,/1,J, where g, is a dimensionless factor 
of order unity that describes how the spin and orbital angular momenta combine 
to give J. (The minus sign is present because electrons have negative charge.) 
Assuming the magnetic field defines the z direction, the energy of interaction of 
this magnetic moment with the applied field is E = —p.- Bap = Ss HpMBapp- 

We'll treat the atoms or ions as if they are independent of one another, so that 
they can be described by Maxwell-Boltzmann statistics. Because the magnetic 
substates m, are nondegenerate, we can write the number of atoms in each mag- 
netic substate in the form of Eq. 10.4: N,, = A~!e2/AT = Am Ne SB Bapp AT 
where 47! is the normalization constant for the Maxwell-Boltzmann distribution. 


The normalization condition requires that N be the total number of atoms in all 
+I +) 
substates: N= ON, =A! oe 8" B"7Bsp/*T We therefore have: 
my=—T my=—J 


Ne Slay Bapp/AL 
Nn, =— (11.37) 
> eo SIH BM Bapp/AT 





my=—J 


The z component of the magnetization is then 


+J 
M= vy Mh, = —V "SO Nn SBM 


all atoms my=—J 


+J 
| —g7Uym,B, /kT 
NV-'g;, > mye 88" ¥op 








my=—T 
= re, (11.38) 
y eS BM Bapp/AT 
my=—J 
The magnetic susceptibility is 
+J 
HoNV—"g sty > mye SIRF appl KT 

MoM ie 

x= i. 7s; (11.39) 
app Bayp bs eSB Bap /kT 
my=—T 


Let’s examine the form of x for the simplest case in which J = 1/2, so that 
we have only two terms in the sums (m,; = —!/2 and + !/2). Equation 11.39 
becomes 
— py Ng bp l(—1/2)e MBF! 4 (41/2) 81M BFanp/ AT) 

VBaop (81 BB app/2kT + 8s! BBapp/2kT) 





M 
= FO 8I" tanh(gy11pBayy/KT) (11.40) 


2VBapp 


Figure 11.54 shows the susceptibility ofa spin- !/2 atom as a function of 1/T, so the 
high-temperature region is on the left side of the graph and the low-temperature 
region is on the right side. In the low-temperature region, the magnetic moments 
become fully aligned, and neither additional cooling nor an increase in the applied 
field can change the magnetization. In the high-temperature region, the graph is 
nearly a straight line, indicating that the susceptibility is linear in 1/7. (How high 
must the temperature be to observe this linear behavior? For a fairly large field of 
1 T the quantity ~pB,,, is about 0.060 meV. When T is 10K, kT is 0.862 meV. 
Thus even with moderately large fields, “high temperature” for this discussion 
means anything above about 10K. At low fields the “high temperature region” 
might reach all the way down to | K.) 

In the high-temperature region, the exponents in Eq. 11.39 are small and we 
can use the approximation e* = | + x for small x (with x = g;gmjBapyp/kT): 

4) 
MoNV'gjMg 2 my(1 — 8)MpmBapp/kT) 


my= 


x= (11.41) 
+J 
Bapp x , (1 ~ SHRM Bap /kT) 


ny 
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FIGURE 11.54 Magnetic susceptibil- 
ity as a function of inverse temperature 
for a spin-!/ material. Note the linear 
behavior at high temperature. 
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4 All quantities other than m, come out of the sums, leaving us three summations 
_ to evaluate: 
o 3 4J 4J 4J 
S yi1=W+1 Yim, =0 So my = 5J(J +1) Q2J +1) (11.42) 
=o ms=—J my=—J my=—J 
2 
2 i After making these substitutions and clearing terms, the result is 
d Net pI +1 
pia al So (11.43) 
>. = a 3VkKT 
0 0.02 0.04 0.06 0.08 
LUT (K-) This shows the linear dependence of the susceptibility on 1/7, as we observed 
in the high-temperature region of Figure 11.54. This result is known as Curie’s 
FIGURE 11.55 Susceptibility of /aw*, which is often expressed in the form x = C/T where C (the combination 


paramagnetic copper sulfate (CuSO, - 
5H, O) between room temperature and 
14K. 


of factors in Eq. 11.43) is called the Curie constant. (It is not a true constant, as 
it takes different values for different substances, but it is constant for any single 
substance.) Figure 11.55 shows the susceptibility of the paramagnetic salt copper 


sulfate, which clearly shows the expected linear behavior down to a temperature 


of about 14K. 
| Example 11.6 





The slope of the graph in Figure 11.55 is 0.0499 K. Assum- 
ing the paramagnetism resides with the copper ions, find 
the value of the quantity x J(J + 1) for copper sulfate. 


Solution 

We first need the value of N/V, which requires the den- 
sity (2.28 x 10° kg/m?) and molar mass (0.250 kg/mole) 
of copper sulfate: 


N _ ONy 
Voo6OM 
(2.28 x 10° kg/m*)(6.02 x 1073 atoms/mole) 





0.250 kg/mole 
= 5.49 x 107” atoms/m? 


Because each molecule of copper sulfate has only one Cu 
ion, this number also gives us the density of Cu ions. We 


then have 


UN uh 
= aE I + 1) 


(4x x 10-7 T- m/A)(5.49 x 1077 m~3) 
x (9.27 x 107*4 J/T)*g3J VU + 1) 
x [3(1.38 x 10773 J/K)]7! 


= (0.0143 K) g3J(J + 1) = 0.0499 K 





Q 
| 


As a result, ae (J + 1) = 3.49. Such experiments give us 
important information about the behavior of ions in crystals 
that is otherwise difficult to obtain. In this case we can learn 
about the value of the effective spin J that describes the 
copper ion. 





In copper sulfate, the divalent copper ion Cut* has the outer electronic 
configuration 3d° (recall that neutral Cu atoms have the configuration 3d!°4s!). 
According to the rules for finding the total S and L for atoms (see Section 8.6), the 
nine 3d electrons in Cut* have S' = !/2 (the maximum possible value of the total 
M,) and L = 2 (the maximum possible resulting value of the total /,). However, 
the measured value of cd (J + 1) of 3.49 from Example 11.6 is more consistent 
with the configuration S = !/2,L = 0 (for which its value would be 3.00), than 


*Physicist Pierre Curie (1859-1906) was the husband of Marie Curie. Together they shared the 1903 
Nobel Prize in physics for their research on radioactivity. Pierre Curie also contributed to the study of 


magnetism. 


it is with any configuration involving S = !/2 and L = 2 (which give values of 
either 2.40 or 12.60). 

This is acommon observation for paramagnetic crystals involving the transition 
metals (those in which the 3d shell is filling). In a magnetic field, which has a 
single preferred direction (usually taken to be the z direction), the component of 
the orbital angular momentum L, along that direction is fixed, while the other 
components (L, and L,,) average to zero. In a crystal, however, there is a strong 
electric field that may have three equivalent directions, and all three components 
of L might average to zero. As a result, the ion behaves as if it has L = 0 (we 
say that the orbital angular momentum is “quenched”), and so only the total S 
contributes to the magnetic moment. In contrast, the rare earth elements also form 
many paramagnetic crystals, but both Z and S contribute to /. In the rare earths, 
the electrons in the unfilled 4f shell are shielded by the filled 5s and Sp shells, 
which have the strongest interactions with the electric field of the crystal (because 
the average radii of their orbits are larger). The “inner” 4f electrons are not much 
affected by the electric field of the crystal and so they contribute their large orbital 
angular momentum to the total J of the ion. 


Ferromagnetism 


In some materials, the individual magnetic dipoles of the ions can align with 
one another even in the absence of an external magnetic field. In this case a 
net magnetization M is present when B,,,, is zero, so clearly the susceptibility 
is undefined for these materials. The most familiar example of this behavior is 
ferromagnetism, in which the neighboring dipoles all align in the same direction. 
(It is also possible to have antiferromagnetic materials, in which the neighboring 
dipoles orient in opposite directions.) 

It is perhaps tempting at first to think of the interaction responsible for 
ferromagnetism as a result of the magnetic field due to one dipole exerting 
a magnetic force to align the neighboring dipole. However, this dipole-dipole 
interaction is far too weak to account for the strong coupling between neighbors 
that produces ferromagnetic alignment. Instead, the effect results from the overlap 
of the wave functions of the electrons in neighboring atoms, in a manner similar 
to covalent bonding but depending on the spins of the electrons. This effect is very 
sensitive to the interaction between the spins and also to the distance between the 
neighboring ions. 

In Fe, Co, and Ni the value of the atomic spacing results in an energy minimum 
for the parallel orientation of neighboring spins, and so these materials are 
ferromagnetic at room temperature (but they become paramagnetic at a sufficiently 
high temperature, where the thermal energy kT exceeds the interaction energy). 
In other materials (such as some of the rare earth elements) the interaction is 
weaker, so they may not exhibit their ferromagnetic behavior until they are cooled 
to a point where the thermal energy kT is smaller than the interaction energy. 
In still other cases, the atomic spacing of the pure element may not permit 
ferromagnetism, but a different atomic spacing in certain compounds containing 
that element may allow the overlap interaction that causes the spins to align. For 
example, Cr is weakly paramagnetic at room temperature, but CrO, (which is 
used to make magnetic recording tape) is ferromagnetic. 

The band theory provides a framework for understanding ferromagnetic behav- 
ior. Consider iron, which has the electronic configuration 3d°4s*. The partially 
filled 3d band can be split into two subbands, one with spin up and one with 
spin down. In the absence of the overlap interaction, the bands are at the same 
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Spin down Spin up 





(a) 


—— 


(b) 


FIGURE 11.56 The 3d band in Fe. 
(a) In the absence of the interac- 
tion, there are 3 electrons per atom in 
the spin-up and spin-down subbands. 
(b) The interaction lowers the relative 
energy of the spin-up band, so there 
are now 4 electrons per atom with spin 
up and 2 with spin down. 
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energy and each band has 3 electrons per atom (out of its maximum capacity of 5) 
as in Figure 11.56a. The effect of the overlap interaction is to raise the energy 
of one band relative to the Fermi energy and lower the energy of the other (as 
in Figure 11.565). Now there are roughly 4 electrons per atom in the spin-up 
subband and 2 in the spin-down subband, and this difference of approximately 2 
electrons per atom in the 3d band is responsible for the net magnetization of iron. 
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1. Compare the equilibrium separations and binding energies 
of ionic solids (Table 11.1) with those of the corresponding 
ionic molecules (Table 9.5). Account for any systematic 
differences. 

2. How should Eq. 11.7 be modified to be valid for MgO and 
BaO? 

3. From Figure 11.11, estimate the Einstein temperature for 
lead. (Hint: Consider Eq. 11.11 when T = Ty.) 

4. A graph of C/T vs. T? for solid argon (similar to 
Figure 11.13) goes through the origin; that is, its y-intercept 
is zero. Explain. 

5. Assuming that its other properties don’t also change with 
temperature, at what temperature would you expect carbon 
to begin to behave like a semiconductor? 

6. Would you expect the Wiedemann-Franz law to apply to 
semiconductors? To insulators? 

7. (a) Why does the electrical conductivity of a metal decrease 
as the temperature is increased? (b) How would you expect 





the conductivity of a semiconductor to change with temper- 
ature? 

8. Why is it that only the electrons near E, contribute to the 
electrical conductivity? 


9. Would you expect silicon to behave like an insulator at a 
low enough temperature? Would it behave like a conductor 
at a high enough temperature? 


10. What determines the drift speed of an electron in a metal? 


What determines the Fermi speed? 


11. Do the superconducting elements have any particular elec- 


tronic structure or configuration in common? 


12. Three different materials have filled valence bands and 
empty conduction bands, and the Fermi energy lies in the 
middle of the gap. The gap energies are 10eV, 1 eV, and 
0.01 eV. Classify the electrical properties of these materials 


at room temperature and at 3 K. 


13. In what way does a p-n junction behave as a capacitor? 


14. 


15. 


16. 


17. 


18. 


11.1 


Semiconductors are sometimes called “nonohmic” materi- 
als. Why? 

If a semiconductor is doped at a level of one impurity atom 
per 10° host atoms, what is the average spacing between the 
impurity atoms? 

Explain the processes that contribute to the current in a 
forward-biased p-n junction. Do the same for a reverse- 
biased junction. 

What limits the response time of a p-n junction when the 
external voltage is varied? Why does a tunnel diode not have 
the same limits? 

The energy gap E, is 0.72 eV for Ge and 1.10 eV for Si. At 
what wavelengths will Ge and Si be transparent to radiation? 
At what wavelengths will they begin to absorb significantly? 


Crystal Structures 


Consider the packing of hard spheres in the simple cubic 
geometry of Figure 11.1. Imagine eight spheres in contact 
with their nearest neighbors, with their centers at the corners 
of the basic cube. (a) What fraction of the volume of each 
sphere is inside the volume of the basic cube? (b) Let r be 
the radius of each sphere and let a be the length of a side 
of the cube. Express a in terms of r. (c) What fraction of 
the volume of the cube is taken up by the portions of the 
spheres? This fraction is called the packing fraction. 
Compute the packing fractions (see Problem 1) of 
(a) the fcc structure (Figure 11.2) and (5) the bec struc- 
ture (Figure 11.3). Which structure fills the space most 
efficiently? 

Derive Eqs. 11.5 and 11.6. 

Calculate the first three contributions to the electrostatic 
potential energy of an ion in the CsCl lattice. 

(a) Find the binding energy per ion pair in CsCl from the 
cohesive energy. (b) Find the binding energy per ion pair in 
CsCl from Eq. 11.7. (c) Find the binding energy per atom 
for CsCl. The ionization energy of Cs is 3.89 eV. 

(a) Find the binding energy per ion pair in LiF from the 
cohesive energy. (b) Find the binding energy per ion pair in 
LiF from Eq. 11.7. (c) Find the binding energy per atom for 
LiF. The ionization energy of Li is 5.39 eV, and the electron 
affinity of F is 3.45 eV. 

Calculate the Coulomb energy and the repulsion energy for 
NaC] at its equilibrium separation. 

The density of sodium is 0.971 g/cm? and its molar mass 
is 23.0 g. In the bec structure, what is the distance between 
sodium atoms? 

Copper has a density of 8.96 g/cm? and molar mass of 
63.5 g. Calculate the center-to-center distance between cop- 
per atoms in the fcc structure. 


19; 


20. 


21. 


22. 


10. 


11.2 
11. 


12. 


13. 


14. 


11.3 
15. 


16. 


17. 


Problems 365 


Why is a semiconductor better than a conductor for applica- 
tions as a solar cell or photon detector? Would an insulator 
be even better? 

Why is the magnetic susceptibility of an electron gas almost 
independent of temperature? 

Would a sample of paramagnetic material be attracted or 
repelled by the N pole of a magnet? By the S pole? How 
woulda sample of diamagnetic material behave? How would 
ferromagnetic material behave? 

Is it possible for a material that has a positive magnetic 
susceptibility at room temperature to have a negative sus- 
ceptibility at higher temperature? 


Calculate the binding energy per atom for metallic Na 
and Cu. 


The Heat Capacity of Solids 


At what temperature do the lattice and electronic heat capaci- 
ties of copper become equal to each other? Take Ty) = 343K 
and E, = 7.03 eV. Which contribution is larger above this 
temperature? Below this temperature? 

(a) The heat capacity of solid argon at a temperature 
of 2.00K is 2.00 x 10~? J/mole-K. What is the Debye 
temperature of solid argon? (See Question 4.) (b) What 
value do you expect for the heat capacity at a temperature 
of 3.00 K? 

When C/T is plotted against 7? for potassium, the graph 
gives a straight line with a slope of 2.57 x 107? J/mole- K*. 
What is the Debye temperature for potassium? 

At a temperature of 4K, the heat capacity of silver is 
0.0134 J/mole-K. The Debye temperature of silver is 225 K. 
(a) What is the electronic contribution to the heat capacity 
at 4K? (b) What are the lattice and electronic contributions 
and the total heat capacity at 2 K? 


Electrons in Metals 


(a) In copper at room temperature, what is the electron 
energy at which the Fermi-Dirac distribution function has 
the value 0.1? (b) Over what energy range does the Fermi- 
Dirac distribution function for copper drop from 0.9 to 
0.1? 

Calculate the de Broglie wavelength of an electron with 
energy E, in copper, and compare the value with the atomic 
separation in copper. 

What is the number of occupied energy states per unit 
volume in sodium for electrons with energies between 0.10 
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and 0.11eV above the Fermi energy at room temperature 
(293 K)? 

The electrical conductivity of copper at room temperature is 
5.96 x 10’ Q7-'m7!. Evaluate the average distance between 
electron scatterings. How many lattice spacings does this 
amount to? 

At what temperature would the Fermi energy of Au be 
reduced by 1%? Compare this temperature with the melting 
point of Au (1337 K). Is it reasonable to assume the Fermi 
energy is a constant, independent of temperature? 

A copper wire of diameter 0.50mm carries a current of 
2.5 mA. What fraction of the copper electrons contributes to 
the electrical conduction? 

Use the Wiedemann-Franz ratio to calculate the thermal 
conductivity of copper at room temperature. The electrical 
conductivity is 5.96 x 107 Q7!-m7!. 


Band Theory of Solids 


Estimate the ratio of the concentration of electrons in the 
conduction band of carbon (an insulator) and silicon (a semi- 
conductor) at room temperature (293 K). The energy gaps 
are 5.5eV for carbon and 1.1 eV for silicon. Assume the 
Fermi energy lies at the center of the gap. 


Estimate the ratio of the number of electrons in the con- 
duction bands of germanium (Z, = 0.66eV) and silicon 
(E, = 1.12 eV) ata temperature of 400 K. Assume the Fermi 
energy is at the center of the gap. 

The valence band in Si has a width of 12eV. In a cube 
of Si that measures 1.0mm on each side, calculate (a) the 
total number of states in the valence band, and (b) the aver- 
age spacing between the states. The density of sodium is 
2.33 g/cm’. 


Superconductivity 


(a) Zirconium metal has a superconducting transition tem- 
perature of 0.61 K. Assuming the validity of the BCS theory, 
what is the energy gap for Zr? (b) If a beam of photons were 
incident on superconducting Zr, what wavelength of photons 
would be sufficient to break up the Cooper pairs? In what 
region of the electromagnetic spectrum are these photons? 
When superconducting tantalum metal is illuminated with 
a beam of photons, it is found that photon wavelengths of 
up to 0.91 mm are sufficient to destroy the superconducting 
state. According to the BCS theory, find the energy gap and 
critical temperature for Ta. 

Find the frequency of the current that results when a voltage 
difference of 1.25 j:V is applied across a Josephson junction. 


Intrinsic and Impurity Semiconductors 


The temperature of a sample of intrinsic silicon is increased 
by 100K over room temperature (293K). Estimate the 
increase in conductivity that we would expect from this 
increase in temperature. The gap energy in silicon is 1.1 eV. 
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(a) Estimate the fraction of the electrons in the valence band 
of intrinsic silicon that can be excited to the conduction band 
at a temperature of 100K and at room temperature. Take 
the energy gap in silicon to be 1.1 eV. (b) By what factor 
would the conductivity of a metal change over this same 
temperature interval? 

Estimate the temperature at which the density of conduc- 
tion electrons in intrinsic germanium equals that of intrinsic 
silicon at room temperature (293 K). The gap energies are 
1.12 eV for Si and 0.66 eV for Ge. 

(a) When we replace an atom of silicon with an atom of 
phosphorus, the outer electron of phosphorus is screened 
so that the atom behaves like a single-electron atom with 
Zope = 1. Compute the energy of the electron, assuming 
that silicon has a dielectric constant of 12 that effec- 
tively reduces the electric field experienced by the electron. 
(6) The additional electron in Si has an effective mass that is 
0.43 times the free electron mass. How does this correction 
factor change the electron energy? 

Assuming the energy gap in intrinsic silicon is 1.1 eV and 
that the Fermi energy lies at the middle of the gap, calculate 
the occupation probability at 293K of (a) a state at the 
bottom of the conduction band and (4) a state at the top of 
the valence band. 

In a sample of germanium at room temperature (293 K), 
what fraction of the Ge atoms must be replaced with donor 
atoms in order to increase the population of the conduction 
band by a factor of 3? Assume all donor atoms are ionized, 
and take the energy gap in Ge to be 0.66 eV. 


Semiconductor Devices 


(a) In a p-n junction at room temperature, what is the ratio 
between the current with a forward bias of 2.00 V to the 
current with a forward bias of 1.00 V? (b) Evaluate the same 
ratio at a temperature of 400 K. 

Under certain conditions, the current in a p-n junction at 
room temperature is observed to be 1.5 mA when a forward 
bias of 0.25 V is applied. If the same voltage were used to 
reverse bias the junction, what would be the current? 
Gallium phosphide (£, = 2.26 eV) and zinc selenide (E, = 
2.87eV) are commonly used to make LEDs. What is the 
most prominent emission wavelength of these devices, and 
what color is the corresponding light? 

LEDs of varying colors can be made by mixing GaN 
(E, = 3.4eV) and InN (E, = 0.7eV) in different propor- 
tions. Calculate the relative amounts of GaN and InN needed 
to produce an LED that emits (a) green light (550 nm) and 
(5) violet light (400 nm). 
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Estimate the fraction of spin-up electrons in sodium that flip 
their spins (as in Figure 11.53) and thus contribute to the 
Pauli paramagnetism. Assume a magnetic field of 1 T. 
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In copper sulfate, the Cutt ions behave as if they have 
J = '/2, for which g, = 2. Ina magnetic field of 0.25 T, cal- 
culate the relative numbers of copper ions in the m, = + 1/2 
and m, = —!/2 states at (a) 300 K and (b) 4.2K. 

Compute the Pauli paramagnetic susceptibilities for (a) Li 
and (b) Ba. Compare with the experimental values, which 
are respectively 14.2 x 10~° and 20.6 x 10~° (in cgs units 
per mole). 

(a) Calculate the expected paramagnetic susceptibility for 
gold, assuming it to behave like a free electron gas. 
(b) The observed cgs molar susceptibility at room tempera- 
ture is —28.0 x 107°. Assuming the susceptibility consists 
only of the diamagnetic and paramagnetic parts, find the 
diamagnetic contribution to the susceptibility of gold. 

(a) The experimental paramagnetic susceptibility of MnCl, 
at room temperature (293 K) in cgs units per mole is 
14350 x 10~°. What is the value of g5J(J +1) for the 
Mn‘ ion? (6) What is the electronic configuration expected 
for the Mn** ion? For this configuration, and assuming that 
the electronic S corresponds to maximizing the total My, 
what is the value of S for Mn**? For the arrangement 
with that S, what is the corresponding total L? (c) With 
J =L-+S, what is the value of g, that follows from the 
experimental susceptibility? 
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By summing the contributions for the attractive and repul- 
sive Coulomb potential energies, show that the Madelung 
constant has the value 2 In 2 for a one-dimensional “lattice” 
of alternating positive and negative ions. 

Plot the cohesive energies of ionic crystals (see Table 11.1 
and other data that you may find) against their melting points 
(see, for example, the Handbook of Chemistry and Physics). 
Is there a correlation between cohesive energies and melting 
points? 

Plot the cohesive energies of metallic crystals (see Table 11.3 
and other data that you may find) against their melting points 
(see, for example, the Handbook of Chemistry and Physics). 
Is there a correlation between cohesive energies and melting 
points? 

(a) By taking the derivative of the total potential energy of 
an ion in a lattice, find an expression for the force on the 
ion. (b) Suppose an ion is displaced from its equilibrium 
position by a small distance x, so that R = Ry +x. Show that 
for small values of x, the force can be written as F = —kx. 
Express k in terms of the other parameters of the crystal. 
(c) Find the value of k for NaCl and evaluate the oscillation 
frequency for a sodium ion. (d) Suppose a sodium ion in 
the lattice absorbed a photon of this frequency and began to 
oscillate. Find the wavelength of the photon. In what region 
of the electromagnetic spectrum is this photon? 

The electric field of a dipole is proportional to 1/7>. Assum- 
ing that the induced dipole moment of molecule B is 
proportional to the electric dipole field of molecule A, 
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show that the van der Waals force is proportional to r~’. 
(Hint: Show that the potential energy of dipole B in the 
electric field caused by dipole A is proportional to r~°.) 

(a) Obtain the data for the heat capacity of aluminum 
between 1K and 100K (see, for example, the Handbook 
of Chemistry and Physics). Plot the data, and by trial and 
error find the value of the Einstein temperature that gives the 
best fit to the data. (b) Plot the data for temperatures below 
10K as C/T vs. T*, determine the slope and intercept, and 
deduce the Debye temperature and effective mass for Al 
(using E, = 11.7eV). 

(a) Obtain the data for the heat capacity of gold between 1 K 
and 100K (see, for example, the Handbook of Chemistry 
and Physics). Plot the data, and by trial and error find the 
value of the Einstein temperature that gives the best fit to 
the data. (b) Plot the data for temperatures below 10K as 
C/T vs. T?, determine the slope and intercept, and deduce 
the Debye temperature and effective mass for Au (using 
Ey, = 5.53 eV). 

(a) A Cooper pair in a superconductor can be considered 
to be a bound state with an energy uncertainty that is of 
the order of the gap energy £,. Assuming these pairs exist 
close to the Fermi energy, find the uncertainty in the loca- 
tion of a Cooper pair, which is a good estimate of its size. 
(b) Estimate the size of a Cooper pair for aluminum (£; = 
11.7 eV) and compare with the lattice spacing in aluminum 
(0.286 nm). 

When a material such as germanium is used as a photon 
detector, an incoming photon makes many interactions and 
excites many electrons across the gap between the valence 
and the conduction band. (a)!37Cs emits a 662-keV gamma 
ray. How many electrons are excited across the 0.66-eV 
gap of germanium by the absorption of this gamma ray? 
(b) The number N calculated in part (a) is subject to sta- 
tistical fluctuations of /N. Compute the variation in N and 
the fractional variation in NV. (c) What is the corresponding 
variation in the measured energy of the gamma ray? This 
result is the experimental resolution of the detector. 

On a single graph, sketch the atomic paramagnetic suscep- 
tibility as a function of inverse temperature for J = 1/2, 1, 
and 3/2. Assume all other coefficients (V/V, ¢ > Bapp) to be 
the same for these three cases. Plot the ratio of the suscepti- 
bility to its maximum for that spin, so you can compare the 
variation in the approach to saturation for these three spins. 
The magnetic field at a distance r from a magnetic dipole 
fis B= Uop/ 2nr>. Show that the dipole-dipole interaction 
energy is too small to account for the ferromagnetism of 
iron at all but the lowest temperatures. Assume an effective 
magnetic dipole moment of 2.2 vp per atom. 

(a) Oxygen gas is observed to be paramagnetic at room tem- 
perature, but nitrogen gas is diamagnetic. Explain this based 
on the filling of the bonding and antibonding 2p orbitals in 
O, and N,. (Hint: See Example 9.1.) (6) Would you expect 
NO gas to be paramagnetic or diamagnetic? 
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Enrico Fermi (1901-1954,  Italy- 
United States). There is hardly a field 
of modern physics to which he did 
not make contributions in theory or 
experiment. He developed the statis- 
tical laws for spin-!/2 particles, and 
in the 1930s he proposed a theory 
of beta decay that is still used today. 
He was the first to demonstrate the 
transmutation of elements by neutron 
bombardment (for which he received 
the 1938 Nobel Prize), and he directed 
the construction of the first nuclear 
reactor. 


The nucleus lies at the center of the atom, occupying only 10~!> of its volume 
but providing the electrical force that holds the atom together. Within the nucleus 
there are Z positive charges. To keep these charges from flying apart, the nuclear 
force must supply an attraction that overcomes their electrical repulsion. This 
nuclear force is the strongest of the known forces; it provides nuclear binding 
energies that are millions of times stronger than atomic binding energies. 

There are many similarities between atomic structure and nuclear structure, 
which will make our study of the properties of the nucleus somewhat easier. 
Nuclei are subject to the laws of quantum physics. They have ground and excited 
states and emit photons in transitions between the excited states. Just like atomic 
states, nuclear states can be labeled by their angular momentum. 

There are, however, two major differences between the study of atomic and 
nuclear properties. In atomic physics, the electrons experience the force provided 
by an external agent, the nucleus; in nuclear physics, there is no such external 
agent. In contrast to atomic physics, in which we can often consider the interactions 
among the electrons as a perturbation to the primary interaction between electrons 
and nucleus, in nuclear physics the mutual interaction of the nuclear constituents 
is Just what provides the nuclear force, so we cannot treat this complicated many- 
body problem as a correction to a single-body problem. We therefore cannot avoid 
the mathematical difficulties in the nuclear case, as we did in the atomic case. 

The second difference between atomic and nuclear physics is that we cannot 
write the nuclear force in a simple form like the Coulomb force. There is no 
closed-form analytical expression that can be written to describe the mutual forces 
of the nuclear constituents. 

In spite of these difficulties, we can learn a great deal about the properties of 
the nucleus by studying the interactions between different nuclei, the radioactive 
decay of nuclei, and the properties of some nuclear constituents. In this chapter and 
the next we describe these studies and how we learn about the nucleus from them. 


12.1 NUCLEAR CONSTITUENTS 


The work of Rutherford, Bohr, and their contemporaries in the years between 
1911 and 1920 showed that the positive charge of the atom is confined in a very 
small nuclear region at the center of the atom, that the nucleus in an atom of 
atomic number Z has a charge of +Ze, and that the nucleus provides most (99.9%) 
of the atomic mass. It was also known that the masses of the atoms (measured in 
atomic mass units) were very nearly integers; a glance at Appendix D confirms 
this observation, usually to within about 0.1%. We call this integer A the mass 
number. It was therefore reasonable to suppose that nuclei are composed of a 
number A of more fundamental units whose mass is very close to | u. Because 
the only particle known at that time with a mass close to | u was the proton 
(the nucleus of hydrogen, with a mass of 1.0073 u and a charge of +e), it was 
postulated (incorrectly, as we shall see) that the nucleus of an atom of mass 
number A contained A protons. 

Such a nucleus would have a nuclear charge of Ae rather than Ze; because 
A > Z for all atoms heavier than hydrogen, this model gives too much positive 
charge to the nucleus. This difficulty was removed by the proton-electron model, 
in which it was postulated (again incorrectly) that the nucleus also contained 
(A — Z) electrons. Under these assumptions, the nuclear mass would be about 


A times the mass of the proton (because the mass of the electrons is negligible) 
and the nuclear electric charge would be A(+e) + (A —Z)(—e) = Ze, in 
agreement with experiment. However, this model leads to several difficulties. 
First, as we discovered in Chapter 4 (see Example 4.7), the presence of electrons 
in the nucleus is not consistent with the uncertainty principle, which would 
require those electrons to have unreasonably large (~ 19 MeV) kinetic energies. 

A more serious problem concerns the total intrinsic spin of the nucleus. From 
measurements of the very small effect of the nuclear magnetic moment on the 
atomic transitions (called the hyperfine splitting), we know that the proton has 
an intrinsic spin of 1/2, just like the electron. Consider an atom of deuterium, 
sometimes known as “heavy hydrogen.” It has a nuclear charge of +e, just like 
ordinary hydrogen, but a mass of two units, twice that of ordinary hydrogen. 
The proton-electron nuclear model would then require that the deuterium nucleus 
contain two protons and one electron, giving a net mass of two units and a net 
charge of one. Each of these three particles has a spin of !/2, and the rules for adding 
angular momenta in quantum mechanics would lead to a spin of deuterium of either 
1/> or 3/2. However, the measured total spin of deuterium is |. For these and other 
reasons, the hypothesis that electrons are a nuclear constituent must be discarded. 

The resolution of this dilemma came in 1932 with the discovery of the neutron, 
a particle of roughly the same mass as the proton (actually about 0.1% more 
massive) but having no electric charge. According to the proton-neutron model, 
a nucleus consists of Z protons and (A — Z) neutrons, giving a total charge of Ze 
and a total mass of roughly A times the mass of the proton, because the proton 
and neutron masses are roughly the same. 

The proton and neutron are, except for their electric charges, very similar to 
one another, and so they are classified together as nucleons. Some properties of 
the two nucleons are listed in Table 12.1. 

The chemical properties of any element depend on its atomic number Z, but 
not on its mass number A. It is possible to have two different nuclei with the same 
Z but with different A (that is, with the same number of protons but different 
numbers of neutrons). Atoms of these nuclei are identical in all their chemical 
properties, differing only in mass and in those properties that depend on mass. 
Nuclei with the same Z but different A are called isotopes. Hydrogen, for example, 
has three isotopes: ordinary hydrogen (Z = 1, A = 1), deuterium (Z = 1, A = 2), 
and tritium (Z = 1,4 = 3). All of these are indicated by the chemical symbol 
H. When we discuss nuclear properties it is important to distinguish among the 
different isotopes. We do this by indicating, along with the chemical symbol, the 
atomic number Z, the mass number A, and the neutron number N = A — Z in the 
following format: 


A 
7Xy 


where X is any chemical symbol. The chemical symbol and the atomic number Z 
give the same information, so it is not necessary to include both of them in the 
isotope label. Also, if we specify Z then we don’t need to specify both N and 


TABLE 12.1 Properties of the Nucleons 
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Name Symbol Charge Mass Rest Energy Spin 




















Proton p +e 1.007276 u 938.28 MeV a 








Neutron n 0 1.008665 u 939.57 MeV a 
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A, It is sufficient to give only the chemical symbol and A. The three isotopes of 
hydrogen would be indicated as |Hy, {H,, and ?H,, or more compactly as 'H, 
7H, and 7H. In Appendix D you will find a list of isotopes and some of their 
properties. 


| Example 12.1 


Give the symbol for the following: (a) the isotope of helium 
with mass number 4; (4) the isotope of tin with 66 neu- 
trons; (c) an isotope with mass number 235 that contains 
143 neutrons. 


Solution 
(a) From the periodic table, we find that helium has Z = 2. 


(6) Again from the periodic table, we know that for tin 
(Sn), Z = 50. We are given N = 66,s0o4 = Z+N= 116. 
The symbol is '{°Sng, or !!°Sn. 

(c) Given that A = 235 and N = 143, we know that 
Z=A-—N=92. From the periodic table, we find that 


this element is uranium, and so the proper symbol for this 


; ‘235 
With 4 = 4, we have N = A—Z =2. Thus the symbol S0tope is “97U} 43 or 3), 


would be $He, or *He. 





12.2 NUCLEAR SIZES AND SHAPES 


Like atoms, nuclei lack a hard surface or an easily definable radius. In fact, 
different types of experiments can often reveal different values of the radius for 
the same nucleus. 

From a variety of experiments, we know some general features of the nuclear 
density. Its variation with the nuclear radius is shown in Figure 12.1. Because 
the nuclear force is the strongest of the forces, we might expect that this strong 
force would cause the protons and neutrons to congregate at the center of the 
nucleus, giving an increasing density in the central region. However, Figure 12.1 
shows that this is not the case—the density remains quite uniform. This gives 
some important clues about the short range of the nuclear force, as we discuss in 
Section 12.4. 

Another interesting feature of Figure 12.1 is that the density of a nucleus seems 
not to depend on the mass number 4; very light nuclei, such as !7C, have roughly 
the same central density as very heavy nuclei, such as *°°Bi. Stated another way, 
the number of protons and neutrons per unit volume is approximately constant 
over the entire range of nuclei: 





number of neutrons and protons = A 


~ 4 3 
z70R 


= constant 





volume of nucleus 


assuming the nucleus to be a sphere of radius R. Thus A « R®, which suggests 
a proportionality between the nuclear radius R and the cube root of the mass 
number: R x A!/3or, defining a constant of proportionality Ro, 


Nuclear density 
































R=R,A'3 (12.1) 





012 3 45 67 8 9 


r (fm) The constant Ry must be determined by experiment, and a typical experiment 
might be to scatter charged particles (alpha particles or electrons, for example) 
from the nucleus and to infer the radius of the nucleus from the distribution 


of scattered particles. From such experiments, we know the value of Rp is 


FIGURE 12.1 The radial dependence 
of the nuclear charge density. 
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approximately 1.2 x 107!5 m. (The exact value depends, as in the case of atomic 
physics, on exactly how we define the radius, and values of Ry usually range from 
1.0 x 107! m to 1.5 x 10~! m.) The length 10~!° m is 1 femtometer (fm), but 
physicists often refer to this length as one fermi, in honor of the Italian-American 
physicist Enrico Fermi. 


| Example 12.2 





Compute the approximate nuclear radius of carbon (A = _ As you can see from Figure 12.1, these values define the 
12), germanium (4 = 70), and bismuth (A = 209). mean radius, the point at which the density falls to half the 


: central value. 
Solution 


Using Eq. 12.1, we obtain: 
Carbon: R= R,A? = (1.2 fm)(12)' = 2.7 fm 
Germanium: R = R,A!/? = (1.2 fm)(70)! = 4.9 fm 
Bismuth: R= Ry = (1.2 fm)(209)!/3 = 7.1 fm 





| Example 12.3 





Compute the density of a typical nucleus, and find the The mass of a hypothetical nucleus with a l-cm radius 
resultant mass if we could produce a nucleus with a radius would be 





of 1 cm. 
Solution = p(gnR’) 
Making a rough estimate of the nuclear mass m as A times = (2 x 10! kg/m*)($2r)(0.01 my° 
the proton mass, we have = 8x 10! kg 
m Am, Am, 
p= eo 4 TR 4 x R A about the mass of a 1-km sphere of ordinary matter! 


1.67 x 107?7k 
=] . ey = 2x 10! kgm’? 
37(1.2 x 107° m) 








The result of Example 12.3 shows the great density of what physicists call 
nuclear matter. Although examples of such nuclear matter in bulk are not found 
on Earth (a sample of nuclear matter the size of a large building would have a 
mass as great as that of the entire Earth), they are found in certain massive stars, in 
which the gravitational force causes protons and electrons to merge into neutrons, 
creating a neutron star (see Section 10.7) that is in effect a giant atomic nucleus! 

One way of measuring the size of a nucleus is to scatter charged particles, such 
as alpha particles, as in Rutherford scattering experiments. As long as the alpha 
particle is outside the nucleus, the Rutherford scattering formula holds, but when 
the distance of closest approach is less than the nuclear radius, deviations from the 
Rutherford formula occur. Figure 12.2 shows the results of a Rutherford scattering 
experiment in which such deviations are observed. (Problem 33 suggests how a 
value for the nuclear radius can be inferred from these data.) 
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FIGURE 12.2 Deviations from the 
Rutherford formula in scattering from FIGURE 12.3 Diffraction of 360- 
208Pb are observed for alpha-particle MeV and 420-MeV electrons by '*C 
energies above 27 MeV. and !°O nuclei. 


Other scattering experiments can also be used to measure the nuclear radius. 
Figure 12.3 shows a sort of “diffraction pattern” that results from the scattering 
of energetic electrons by a nucleus. In each case the first diffraction minimum 
is clearly visible. (The intensity at the minimum doesn’t fall to zero because 
the nuclear density doesn’t have a sharp edge, as illustrated in Figure 12.1.) For 
scattering of radiation of wavelength A by a circular disc of diameter D, the first 
diffraction minimum should appear at an angle of 9 = sin“! (1.22/D). (Review 
Example 4.2 for another example of this calculation.) At an electron energy of 
420 MeV, the observed minima for !°O and !*C give a radius of 2.6 fm for !°O 
and 2.3 fm for '*C (see Problem 34), in agreement with the values 3.0 fm and 2.7 
fm computed from Equation 12.1. 


12.3 NUCLEAR MASSES AND BINDING ENERGIES 


Suppose we have a proton and an electron at rest separated by a large distance. 
The total energy of this system is the total rest energy of the two particles, 
myc +m,c’. Now we let the two particles come together to form a hydrogen 
atom in its ground state. In the process, several photons are emitted, the total 
energy of which is 13.6eV. The total energy of this system is the rest energy of 
the hydrogen atom, m(H)c’, plus the total photon energy, 13.6eV. Conservation 
of energy demands that the total energy of the system of isolated particles must 
equal the total energy of atom plus photons: mc? + m,c” = m(H)c? + 13.6eV, 
which we write as 


mc’ +m,c’ — m(H)c* = 13.6eV 
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That is, the rest energy of the combined system (the hydrogen atom) is less than 
the rest energy of its constituents (an electron and a proton) by 13.6eV. This 
energy difference is the binding energy of the atom. We can regard the binding 
energy as either the “extra” energy we obtain when we assemble an atom from its 
components or else the energy we must supply to disassemble the atom into its 
components. 

Nuclear binding energies are calculated in a similar way. Consider, for example, 
the nucleus of deuterium, 7Hy: which is composed of one proton and one neutron. 
The nuclear binding energy of deuterium is the difference between the total rest 
energy of the constituents and the rest energy of their combination: 


B=m,c’ + Myc — mpc (12.2) 
where mp is the mass of the deuterium nucleus. To finish the calculation, we 
replace the nuclear masses m, and mp with the corresponding atomic masses: 
m(‘H)c? = myc? +m,c? —13.6eV and mCH)c* = mpc? + m,c? — 13.6eV. 
Substituting into Eq. 12.2, we obtain 


B=m,c + [m(H)c? — m,c? + 13.6eV] — [mCH)c? — m,c? + 13.6 eV] 
= [m, + mH) — m?H)]c? 


Notice that the electron mass cancels in this calculation. For deuterium, we then 
have 


B = (1.008665 u + 1.007825 u — 2.014102 u)(931.5 MeV/u) = 2.224 MeV 


Here we use c* = 931.5 MeV/u to convert mass units to energy units. 

Let’s generalize this process to calculate the binding energy of a nucleus X 
of mass number A with Z protons and N neutrons. Let my represent the mass of 
this nucleus. Then the binding energy of the nucleus is, by analogy with Eq. 12.2, 
the difference between the nuclear rest energy and the total rest energy of its 
constituents (NV neutrons and Z protons): 


B= Nm,c’ + Zm,c* — mxc (12.3) 


In order to use tabulated atomic masses to do this calculation, we must replace 
the nuclear mass my with its corresponding atomic mass: m(ZXy)c* = myc? + 
Zm,c’ — B,, where B, represents the total binding energy of all the electrons 
in this atom. Nuclear rest energies are of the order of 10° to 10!! eV, total 
electron rest energies are of the order of 10° to 10% eV, and electron binding 
energies are of the order of | to 10° eV. Thus B, is very small compared with the 
other two terms, and we can safely neglect it to the accuracy we need for these 
calculations. 

Substituting atomic masses for the nuclear masses m, and my, we obtain an 
expression for the total binding energy of any nucleus 4Xy: 


B= [Nm, + Zm(Hy) — mGXy)|e* (12.4) 


The electron masses cancel in this equation, because on the right side we have 
the difference between the mass of Z hydrogen atoms (with a total of Z electrons) 
and the atom of atomic number Z (also with Z electrons). The masses that appear 
in Eq. 12.4 are atomic masses. 
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| Example 12.4 





Find the total binding energy B and also the average binding for Or hes 
56 238 
energy per nucleon B/A for 53 Fe39 and “95 U jy. 


Solution 


B= [146(1.008665 u) + 92(1.007825 u) 
—238.050788 u](931.5 MeV/u) 


From Eq. 12.4, for 36Fe3) with N = 30 and Z = 26, 


= 1802 MeV 
B = [30(1.008665 u) + 26(1.007825 u) & Ase 
— 55.934937 u](931.5 MeV/u) a= — = 7.570 MeV per nucleon 
= 492.3 MeV 
B 492.3 MeV 
z= — = 8.790 MeV per nucleon 





Example 12.4 gives us insight into an important aspect of nuclear structure. 
The values of B/A show that the nucleus *°Fe is relatively more tightly bound than 
the nucleus 7*8U; the average binding energy per nucleon is greater for *°Fe than 
for 738U. Alternatively, this calculation shows that, given a large supply of protons 
and neutrons, we would release more energy by assembling those nucleons into 
nuclei of °°Fe than we would by assembling them into nuclei of 77°U. 

Repeating this calculation for the entire range of nuclei, we obtain the results 
shown in Figure 12.4. The binding energy per nucleon starts at small values 
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FIGURE 12.4 The binding energy per nucleon. 
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(0 for the proton and neutron, 1.11 MeV for deuterium), rises to a maximum of 
8.795 MeV for °Ni, and then falls to values of around 7.5 MeV for the heavy 
nuclei. 

The binding energy per nucleon is roughly constant over a fairly wide range of 
nuclei. From the region around A = 60 there is a sharp decrease for light nuclei, 
which is caused by their having an increasing relative fraction of loosely bound 
protons and neutrons on the surface. There is a gradual decrease for the more 
massive nuclei, due to the increasing Coulomb repulsion of the protons. 

Figure 12.4 suggests that we can liberate energy from the nucleus in two 
different ways. If we split a massive nucleus (say, A > 200) into two lighter 
nuclei, energy is released, because the binding energy per nucleon is greater for 
the two lighter fragments than it is for the original nucleus. This process is known 
as nuclear fission. Alternatively, we could combine two light nuclei (A < 10, for 
example) into a more massive nucleus; again, energy is released when the binding 
energy per nucleon is greater in the final nucleus than it is in the two original 
nuclei. This process is known as nuclear fusion. We consider fission and fusion 
in greater detail in Chapter 13. 


Proton and Neutron Separation Energies 


If we add the ionization energy £; (13.6eV) to a hydrogen atom, we obtain a 
hydrogen ion Ht and a free electron. In terms of the rest energies of the particles, 
we can write this process as E; + m(H)c? = m(H+)c” + m,c’. If we generalize to 
an arbitrary element X, this becomes £; + m(X)c? = m(X*)c? + mc’, or 


X—>Xt+e: BE, = m(X*)c? +m, c? — m(X)c* = [m(X*) +m, — m(X)]e* 


In the case of element X, the ionization energy gives the smallest amount of 
energy necessary to remove an electron from an atom, and we saw in Figure 8.8 
how the ionization energy provides important information about the properties of 
atoms. 

For nuclei, a process similar to ionization consists of removing the least tightly 
bound proton or neutron from the nucleus. The energy required to remove the 
least tightly bound proton is called the proton separation energy S,. If we add 
energy S, to a nucleus 4Xy, we obtain the nucleus a and a free proton. In 
analogy with the atomic case, we can write the separation energy as 


gXy > ZX y tp: Sy = [mG Xy) + mCH) — mEXy) Ie? (12.5) 


using atomic masses. Similarly, if we add the neutron separation energy S,, to 
nucleus 4X, we obtain the nucleus a cme and a free neutron: 


4Xy > 4 jXy_1 tm 8, = [mp Xy_1) +m, — mMEXy) Ie? (12.6) 


Proton and neutron separation energies are typically in the range of 5-10 MeV. 
It is no coincidence that this energy is about the same as the average binding 
energy per nucleon. The total binding energy B of a nucleus is the energy needed 
to take it apart into Z free protons and N free neutrons. This energy is the sum of 
A proton and neutron separation energies. 
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| Example 12.5 





Find the proton separation energy and the neutron separa- For neutron separation, '*°Te > !*4Te +n. The neutron 


tion energy of !*°Te. 


Solution 


To separate a proton, '°Te > '*4Sb + p. Using Eq. 12.5, 


the proton separation energy is 


S, = (m('*Sb) + m('H) — m(1*°Te) Je? 


separation energy is (from Eq. 12.6) 
S, = [m(*Te) + m, — mC?°Te)]c? 
= (123.902818 u + 1.008665 u 
—124.904431 u)(931.50 MeV/u) 
= 6.569 MeV 


= (123.905936 u + 1.007825 u 
—124.904431 u)(931.50 MeV/u) 


= 8.691 MeV 





The proton and neutron separation energies play a role in nuclei similar to 
that of the ionization energy in atoms. Figure 12.5 shows a plot of the neutron 
separation energies of nuclei with a “valence” neutron (and no valence proton) 
from Z = 36 to Z = 62. As we add neutrons, the neutron separation energy 
decreases smoothly except near N = 50 and N = 82, where there are more 
sudden decreases in the separation energy. In analogy with atomic physics (see 
Figure 8.8), these sudden decreases are associated with the filling of shells. The 
motions of neutrons and protons in the nucleus are described in terms of a shell 
structure that is similar to that of atomic shells, and when a neutron or proton is 
placed into a new shell it is less tightly bound and its separation energy decreases. 
The neutron separation data indicate that there are closed neutron shells at VN = 50 
and N = 82. Relationships such as Figure 12.5 provide important information 
about the shell structure of nuclei. 
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FIGURE 12.5 The neutron separation energy. The lines connect isotopes 
of the same element that have an odd neutron, starting on the left at 
Z = 36 and ending on the right at Z = 62. 


12.4 THE NUCLEAR FORCE 


Our successful experience with using the simplest atom, hydrogen, to gain insights 
into atomic structure suggests that we should begin our study of the nuclear force 
by looking at the simplest system in which that force operates—the deuterium 


nucleus, which consists of one proton and one neutron. For example, we might 
hope to learn something about the nuclear force from the photons emitted in 
transitions between the excited states of this nucleus. Unfortunately, this strategy 
does not work—deuterium has no nuclear excited states. When we bring a proton 
and an electron together to form a hydrogen atom, many photons may be emitted 
as the electron drops into its ground state; from this spectrum we learn the energies 
of the excited states. When we bring a proton and a neutron together to form 
a deuterium nucleus, only one photon (of energy 2.224 MeV) is emitted as the 
system drops directly into its ground state. 

Even though we can’t use the excited states of deuterium, we can learn about the 
nuclear force in the proton-neutron system by scattering neutrons from protons as 
well as by doing a variety of different experiments with heavier nuclei. From these 
experiments we have learned the following characteristics of the nuclear force: 


1. The nuclear force is the strongest of the known forces, and so it is sometimes 
called the strong force. For two adjacent protons in a nucleus, the nuclear 
interaction is 10—100 times stronger than the electromagnetic interaction. 

2. The strong nuclear force has a very short range—the distance over which 
the force acts is limited to about 107!> m. This conclusion follows from the 
constant central density of nuclear matter (Figure 12.1). As we add nucleons 
to a nucleus, each added nucleon feels a force only from its nearest neighbors, 
and not from all the other nucleons in the nucleus. In this respect, a nucleus 
behaves somewhat like a crystal, in which each atom interacts primarily with 
its nearest neighbors, and additional atoms make the crystal larger but don’t 
change its density. Another piece of evidence for the short range comes from 
Figure 12.4. Because the binding energy per nucleon is roughly constant, total 
nuclear binding energies are roughly proportional to A. For a force with long 
range (such as the gravitational and electrostatic forces, which have infinite 
range) the binding energy is roughly proportional to the square of the number of 
interacting particles. (For example, because each of the Z protons in a nucleus 
feels the repulsion of the other Z — 1 protons, the total electrostatic energy of 
the nucleus is proportional to Z(Z — 1), which is roughly Z? for large Z.) 

Figure 12.6 illustrates the dependence of the nuclear binding energy on the 
separation distance between the nucleons. The binding energy is relatively 
constant for separation distances less than about 1 fm, and it is zero for 
separation distances much greater than | fm. 

3. The nuclear force between any two nucleons does not depend on whether the 
nucleons are protons or neutron—the n-p nuclear force is the same as the n-n 
nuclear force, which is in turn the same as the nuclear portion of the p-p force. 


A successful model for the origin of this short-range force is the exchange 
force. Suppose we have a neutron and a proton next to one another in the nucleus. 
The neutron emits a particle, on which it exerts a strong attractive force. The 
proton also exerts a strong force on the particle, perhaps strong enough to absorb 
the particle. The proton then emits a particle that can be absorbed by the neutron. 
The proton and neutron each exert a strong force on the exchanged particle, and 
thus they appear to exert a strong force on each other. The situation is similar to 
that shown in Figure 12.7, in which two people play catch with a ball to which 
each is attached by a spring. Each player exerts a force on the ball, and the effect 
is as if each exerted a force on the other. 

How can a neutron of rest energy m,c* emit a particle of rest energy 
mc? and still remain a neutron, without violating conservation of energy? The 
answer to this question can be found from the uncertainty principle, AE At ~ h. 


2 
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FIGURE 12.6 Dependence of nuclear 
binding energy on the separation dis- 
tance of nucleons. 





FIGURE 12.7 An attractive exchange 
force. 
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We don’t know that energy has been conserved unless we measure it, and we can’t 
measure it more accurately than the uncertainty AZ in a time interval At. We 
can therefore “violate” energy conservation by an amount AEF for a time interval 
of at most At = i/AE. The amount by which energy conservation is violated in 
our exchange force model is mc’, the rest energy of the exchanged particle. This 
particle can thus exist only for a time interval (in the laboratory frame) of at most 


h 
A= (12.7) 
The longest distance this particle can possibly travel in the time At is x = cAt, 
since it can’t move faster than the speed of light. With x = cAt = ch/mc’, we 
then have a relationship between the maximum range of the exchange force and 
the rest energy of the exchanged particle: 


mc = — (12.8) 
Inserting into this expression an estimate for the range of the nuclear force of 
10-' m or | fm, we can estimate the rest energy of the exchanged particle: 


h 200 MeV - fi 
mc = c = a = 200 MeV 
x 1 fm 





The exchanged particle cannot be observed in the laboratory during the exchange, 
for to do so would violate energy conservation. However, if we provide energy to 
the nucleons from an external source (for example, by causing a nucleus to absorb 
a photon), the “borrowed” energy can be repaid and the particle can be observed. 
When we carry out this experiment, the nucleus is found to emit pi mesons 
(pions), which have a rest energy of 140 MeV, remarkably close to our estimate of 
200 MeV. Many observable properties of the nuclear force have been successfully 
explained by a model based on the exchange of pions. We discuss the properties 
of pions in Chapter 14. Other exchanged particles contribute to different aspects 
of the nuclear force. For example, an exchanged particle is responsible for the 
repulsive part of the force at very short range, which keeps the nucleons from all 
collapsing toward the center of the nucleus (see Problem 10). 


12.5 QUANTUM STATES IN NUCLEI 


Ideally we would like to solve the Schrédinger equation using the nuclear potential 
energy. This process, if it were possible, would give us a set of energy levels for 
the protons and neutrons that we could then compare with experiment (just as we 
did for the energy levels of electrons in atoms). Unfortunately, we cannot carry 
through with this program for several reasons: the nuclear potential energy cannot 
be expressed in a convenient analytical form, and it is not possible to solve the 
nuclear many-body problem except by approximation. 

Nevertheless, we can make some simplifications that allow us to analyze the 
structure and properties of nuclei by using techniques already introduced in this 
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book. We’ll represent the nuclear potential energy as a finite potential well of 
radius R equal to the nuclear radius. That confines the nucleons to a nucleus-sized 
region and allows them to move freely inside that region. 

Let’s consider !*°Te (a nucleus very close to the center of the range of nuclei), 
which we analyzed in Example 12.5. The width of the potential energy well is equal 
to the nuclear radius, which we find from Eq. 12.1: R = (1.2 fm)(125) 1B — 6.0 fm. 
The second quantity we need to know is the depth of the potential energy well. 
We’ll consider the neutrons and protons separately. The 73 neutrons in !?°Te will 
fill a series of energy levels in the potential energy well. The top of the well is at 
E = 0 (above which the neutrons would become free). The bottom of the well is 
at a negative energy — U5. The neutrons fill the levels in the well starting at — Up 
and ending not at energy zero but at energy —6.6 MeV, as we found in Example 
12.5. That is, we must add a minimum of 6.6 MeV to raise the least tightly bound 
neutron out of the well and turn it into a free neutron. 

To find the energy difference between the bottom of the well and the highest 
filled state, we can consider the nucleus to be a “gas” of neutrons and protons 
whose energies are described by the Fermi-Dirac distribution (Chapter 10). A 
statistical distribution is intended to describe systems with large numbers of 
particles, but it should be a reasonable rough approximation for our “gas” of 73 
neutrons. To find the energy of the highest filled state, we need the Fermi energy 
of the neutrons (using Eq. 10.50 with V = $7 R3 = 900 fm? as the volume of the 
'5Te nucleus): 


PSN | we pan 
Ey = —— aT = ———— —— 
2m \ 8nV 2mc? \ 8nV 


_ (1240 MeV. fm)? 3(73) 
~  2(940 MeV) 82: (900 fim?) 





2/3 
= 37.0 MeV 


Figure 12.8 shows the resulting potential energy well for neutrons. The depth of 
the well is the sum of the neutron separation energy S, and the Fermi energy: 
Uy = S, + Ep = 6.6 MeV + 37.0 MeV = 43.6 MeV. 

A similar calculation for the 52 protons in !?°Te gives E,, = 29.5 MeV. For the 
protons, S, + E, = 8.7 MeV + 29.5 MeV = 38.2 MeV, much less than the depth 
we determined for the neutron well. The difference between the depths of the 
neutron and proton wells is due to the Coulomb repulsion energy of the protons, 
which makes the protons less tightly bound than the neutrons. Figure 12.9 gives 
a representation of the proton states in their potential energy well. 


Quantum States and Radioactive Decay 


Figure 12.10 shows the protons and neutrons near the top of their potential energy 
wells. Note that we can add energy to the nucleus that is less than the proton or 
neutron separation energies. In the region between E = —S, or —S, and E = 0 
are the nuclear excited states in which a proton or a neutron can absorb energy 
and move from its ground state to one of the unoccupied higher states. As was 
the case with atoms, the nucleus can make transitions from excited states to lower 
excited states or to the ground state by photon emission. In the case of nuclei, 
those photons are called gamma rays and have typical energies of 0.1 MeV to a 
few MeV. 
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FIGURE 12.8 — Neutron states in a 
potential energy well for the 73 neu- 
trons of !?>Te. 
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FIGURE 12.9 Proton states ina poten- 
tial energy well for the 52 protons of 
125Te. 
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It is also possible to have other nuclear transformations that can be represented 
in Figure 12.10. It is clearly not possible for this nucleus spontaneously to emit 
a proton or a neutron—we have seen that it takes many MeV to boost a bound 
proton or neutron to a free state. However, it is possible simultaneously to boost 
two protons and two neutrons and form them into an alpha particle (SHe,). If the 
energy gained in the formation of the alpha particle (its binding energy, 28.3 MeV) 
is greater than the sum of the four separation energies, there will be a net energy 
gain in the process; this energy can appear as the kinetic energy of the alpha 
particle that is emitted by the nucleus. This process is called nuclear alpha decay. 
You can see from the neutron and proton separation energies that this process 
does not occur for !*°Te. 

Another type of transformation occurs under certain circumstances when a 
neutron changes into a proton and drops into one of the empty proton states at 
lower energy. Under other circumstances, in which the proton levels are higher 
and the neutron levels are lower, a proton can transform into a neutron and drop 
into one of the empty neutron states. This process is called nuclear beta decay. It 
is not always obvious from diagrams such as Figure 12.10 whether this type of 
transformation will occur, because changing a neutron to a proton increases the net 
Coulomb energy of the nucleus and thus increases the energy of all of the proton 
states. Neither the neutron-to-proton nor the proton-to-neutron transformation can 
occur for !*>Te. 


12.6 RADIOACTIVE DECAY 


Figure 12.11 shows a plot of all the known nuclei, with stable nuclei indicated by 
dark shading. For the lighter stable nuclei, the neutron and proton numbers are 
roughly equal. However, for the heavy stable nuclei, the factor Z(Z — 1) in the 
Coulomb repulsion energy grows rapidly, so extra neutrons are required to supply 
the additional binding energy needed for stability. For this reason, all heavy stable 
nuclei have N > Z. 


100 




















4 


























Proton number Z 






























































0 
Oo 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 
Neutron number N 


FIGURE 12.11 Stable nuclei are shown in color; known radioactive nuclei are in light 


shading. 


Most of the nuclei represented in Figure 12.11 are unstable, which means that 
they transform themselves into more stable nuclei by changing their Z and N 
through alpha decay (emission of *He) or beta decay (changing a neutron to a 
proton or a proton to a neutron). Nuclei are unstable in excited states, which 
can transition to ground states through gamma decay (emission of photons). The 
three decay processes (alpha, beta, and gamma decay) are examples of the general 
subject of radioactive decay. In the remainder of this section, we establish some 
of the basic properties of radioactive decay, and in the following sections we treat 
alpha, beta, and gamma decay separately. 

The rate at which unstable radioactive nuclei decay in a sample of material is 
called the activity of the sample. The greater the activity, the more nuclear decays 
per second. (The activity has nothing to do with the kind of decays or of radiations 
emitted by the sample, or with the energy of the emitted radiations. The activity 
is determined only by the number of decays per second.) 

The basic unit for measuring activity is the curie.* Originally, the curie was 
defined as the activity of one gram of radium; that definition has since been 
replaced by a more convenient one: 


1 curie (Ci) = 3.7 x 10!° decays/s 
One curie is quite a large activity, and so we work more often with units of 


millicurie (mCi), equal to 10-7 Ci, and microcurie (wCi), equal to 10~° Ci. 


*The SI unit of activity is the becquerel (Bq), named for Henri Becquerel, the French scientist who 
discovered radioactivity in 1896. One becquerel equals one decay/s, so 1 Ci=3.7 x 10!°Bq. 


12.6 | Radioactive Decay 383 


384 Chapter 12 | Nuclear Structure and Radioactivity 


Marie Curie (1867-1934, Poland- 
France). Her pioneering studies of the 
natural radioactivity of radium and 
other elements earned her two Nobel 
Prizes, the physics prize in 1903 for 
the discovery of radioactivity (shared 
with Henri Becquerel and with her 
husband, Pierre) and the unshared 
chemistry prize in 1911 for the iso- 
lation of pure radium. She established 
the Institute of Radium at the Univer- 
sity of Paris, where she continued to 
pursue research in the medical appli- 
cations of radioactive materials. Her 
daughter Irene was awarded the 1935 
Nobel Prize in chemistry for the dis- 
covery of artificial radioactivity. 





Consider a sample with a mass of a few grams, containing the order of 10? atoms. 
If the activity were as large as 1 Ci, about 10!° of the nuclei in the sample would 
decay every second. We could also say that for any one nucleus, the probability of 
decaying during each second is about 10!°/107? or 10~!°. This quantity, the decay 
probability per nucleus per second, is called the decay constant (represented by 
the symbol 4). We assume that A is a small number, and that it is constant in time 
for any particular material—the probability of any one nucleus decaying doesn’t 
depend on the age of the sample. The activity a depends on the number N of 
radioactive nuclei in the sample and also on the probability 4 for each nucleus to 
decay: 


a=iN (12.9) 


which is equivalent to decays/s = decays/s per nucleus x number of nuclei. 

Both a and WN are functions of the time ¢. As our sample decays, N certainly 
decreases—there are fewer radioactive nuclei left. If N decreases and A is 
constant, then a must also decrease with time, and so the number of decays per 
second becomes smaller with increasing time. 

We can regard a as the change in the number of radioactive nuclei per unit 
time—the more nuclei decay per second, the larger is a. 

dN 
= (12.10) 
A minus sign must be present because dN/dt is negative (N is decreasing with 
time), and we want a to be a positive number.) Combining Eqs. 12.9 and 12.10 
we have dN /dt = —AN, or 


dN 

— = —-)dt 12.11 

y (12.11) 
This equation can be integrated directly to yield 

N=Ne™ (12.12) 


where Np represents the number of radioactive nuclei originally present at f = 0. 
Equation 12.12 is the exponential law of radioactive decay, which tells us how 
the number of radioactive nuclei in a sample decreases with time. We can’t easily 
measure N, but we can put this equation in a more useful form by multiplying on 
both sides by 4, which gives 


a=ae™ (12.13) 


where dg is the original activity (ay = ANo). 

Suppose we count the number of decays of our sample in one second (by 
counting for one second the radiations resulting from the decays). Repeating the 
measurement, we could then plot the activity a as a function of time, as shown in 
Figure 12.12. This plot shows the exponential dependence expected on the basis 
of Eq. 12.13. 

It is often more useful to plot a as a function of ¢ on a semilogarithmic scale, 
as shown in Figure 12.13. On this kind of plot, Eq. 12.13 gives a straight line of 
slope —A. 
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FIGURE 12.12 Activity of a radioac- 
tive sample as a function of time. 
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FIGURE 12.13 Semilog plot of ac- 


tivity versus time. 


The half-life, t,;., of the decay is the time that it takes for the activity to be 
reduced by half, as shown in Figure 12.12. That is, when t= t) 7,4 = 5p = 


aye “1/2, from which we find 


1 0.693 
hy => x In2= oa 


(12.14) 


Another useful parameter is the mean lifetime t (see Problem 37): 


t= = 
Xr 


When t = t,a = age”! = 0.37ap. 
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(12.15) 





The half-life of !?*Au is 2.70 days. (a) What is the decay 
constant of !°8Au? (b) What is the probability that any 
8 Au nucleus will decay in one second? (c) Suppose 
we had a 1.00-g sample of '°8Au. What is its activity? 
(d) How many decays per second occur when the sample 
is one week old? 





Solution 
(a) 
ve 0.693 0.693 1d Lh 
~ thy, ~ 2.70d 24h 3600s 
=2.97x 10°°s"! 


(b) The decay probability per second is just the decay 
constant, so the probability of any '° Au nucleus decaying 
in one second is 2.97 x 107°. 


(c) The number of atoms in the sample is determined from 
the Avogadro constant N, and the molar mass M: 


y= TNA 
M 
_ (1.00 x 107° g)(6.02 x 10° atoms/mole) 
~ 198 g/mole 
= 3.04 x 10° atoms 
a=AN = (2.97 x 10°° s_!)(3.04 x 10!) 


= 9.03 x 10° Bq = 0.244 Ci 





(d) The activity decays according to Eq. 12.13: 


—At 
a = age 


= (9.03 x 10° Bqje7 2:97 10% s~!)(7 d)(3600 s/d) 
= 1.50 x 10° Bq 
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The half-life of 23>U is 7.04 x 108 y. A sample of rock, Each half-life reduces N by a factor of 2, so the overall 


which solidified with the Earth 4.55 x 10° years ago, con- reduction in N has been 


2°46 — 88.2. The original rock 


tains N atoms of 235U. How many 735U atoms did the same _ therefore contained 88.2N atoms of *°U. 


rock have at the time it solidified? 


Solution 
The age of the rock corresponds to 


4.55 x 10° y 
7.04 x 108 y 


= 6.46 half-lives 





Conservation Laws in Radioactive Decays 


Our study of radioactive decays and nuclear reactions reveals that nature is not 
arbitrary in selecting the outcome of decays or reactions, but rather that certain 
laws limit the possible outcomes. We call these laws conservation laws, and 
we believe these laws give us important insight into the fundamental workings 
of nature. Several of these conservation laws are applied to radioactive decay 
processes. 


1. Conservation of Energy Perhaps the most important of the conservation 
laws, conservation of energy tells us which decays are energetically possible and 
enables us to calculate rest energies or kinetic energies of decay products. A 
nucleus X will decay into a lighter nucleus X’, with the emission of one or more 
particles we call collectively x only if the rest energy of X is greater than the total 
rest energy of X’ + x The excess rest energy is known as the QO value of the decay 
X—> X’4x: 


O = [my — (my + m,)]c* (12.16) 
where the m’s represent the nuclear masses. The decay is possible only if this 
O value is positive. The excess energy Q appears as kinetic energy of the decay 
products (assuming X is initially at rest): 

O= Ky + Kk, (12.17) 
2. Conservation of Linear Momentum If the initially decaying nucleus is 


at rest, then the total linear momentum of all of the decay products must sum to 
Zero: 


Py +p, = 0 (12.18) 
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Usually the emitted particle or particles x are much less massive than the residual 
nucleus X’, and the recoil momentum py; yields a very small kinetic energy Ky. 

If there is only one emitted particle x, Eqs. 12.17 and 12.18 can be solved 
simultaneously for Ky, and K,. If x represents two or more particles, we have 
more unknowns than we have equations, and no unique solution is possible. In 
this case, a range of values from some minimum to some maximum is permitted 
for the decay products. 


3. Conservation of Angular Momentum The total spin angular momentum 
of the initial particle before the decay must equal the total angular momentum 
(spin plus orbital) of all of the product particles after the decay. For example, the 
decay of a neutron (spin angular momentum = !/2) into a proton plus an electron is 
forbidden by conservation of angular momentum, because the spins of the proton 
and electron, each equal to 1/2, can be combined to give a total of either 0 or 1, 
neither of which is equal to the initial angular momentum of the neutron. Adding 
integer units of orbital angular momentum to the electron does not restore angular 
momentum conservation in this decay process. 


4. Conservation of Electric Charge This is such a fundamental part of all 
decay and reaction processes that it hardly needs elaborating. The total net electric 
charge before the decay must equal the net electric charge after the decay. 


5. Conservation of Nucleon Number In some decay processes, we can 
create particles (photons or electrons, for example) which did not exist before the 
decay occurred. (This of course must be done out of the available energy—that 
is, it takes 0.511 MeV of energy to create an electron.) However, nature does not 
permit us to create or destroy protons and neutrons, although in certain decay 
processes we can convert neutrons into protons or protons into neutrons. The total 
nucleon number A does not change in decay or reaction processes. In some decay 
processes, A remains constant because both Z and N remain unchanged; in other 
processes Z and N both change in such a way as to keep their sum constant. 


12.7 ALPHA DECAY 


In alpha decay, an unstable nucleus disintegrates into a lighter nucleus and an 
alpha particle (a nucleus of *He), according to 


{Xv > Z3Xy_2 + Hey (12.19) 


where X and X’ represent different nuclei. For example, *{8Raj3g > 7g2Rnj36 + 
Hey. 

Decay processes release energy, because the decay products are more tightly 
bound than the initial nucleus. The energy released, which appears as the kinetic 
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decays, resulting in a nucleus X’ and 
an alpha particle. 
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energy of the alpha particle and the “daughter’ nucleus X’, can be found from the 
masses of the nuclei involved according to Eq. 12.16: 


O = [m(X) — m(X’) — m(*He)]c? (12.20) 


As we did in our calculations of binding energy, we can show that the electron 
masses cancel in Eq. 12.20, and so we can use atomic masses. This energy Q 
appears as kinetic energy of the decay products: 


Q=Ky +K, (12.21) 


assuming we choose a reference frame in which the original atom X is at 
rest. Linear momentum is also conserved in the decay process, as shown in 
Figure 12.14, so that 


Po = Px! (12.22) 


From Eqs. 12.21 and 12.22 we eliminate py, and Ky, because we normally don’t 
observe the daughter nucleus in the laboratory. Typical alpha decay energies are a 
few MeV; thus the kinetic energies of the alpha particle and the nucleus are much 
smaller than their corresponding rest energies, and so we can use nonrelativistic 
mechanics to find 

A-4 


K,=—_— @ (12.23) 





Find the kinetic energy of the alpha particle emitted in the The kinetic energy is given by Eq. 12.23: 
alpha decay process *°Ra —> ???Rn + *+He. 


Solution 
From Eq. 12.20 the Q value is 


OQ = [m??°Ra) _ m(C~?Rn) = 


K ae ae (4.871 MeV) 
oa Hig pe 


= 4.785 MeV 
m(*He)]c” 


= (226.025410 u — 222.017578 u 
—4.002603 u)(931.5 MeV/u) 


= 4.871 MeV 





Table 12.2 shows some sample alpha decays and their half-lives. You can see 
from the table that small changes in the decay energy (about a factor of 2) result in 
enormous changes in the half-life (24 orders of magnitude)! For example, for the 
isotopes 7>?Th and °°Th (which have the same Z and therefore the same Coulomb 
interaction between the alpha particle and the product nucleus) the kinetic energy 
changes by only 0.68 MeV (about 15%), while the half-life changes by about five 
orders of magnitude. Any successful calculation of the alpha decay probabilities 
must account for this sensitivity to the decay energy. 


TABLE 12.2 Some Alpha Decay Energies and Half-Lives 
































Isotope K, (MeV) ty iG) 

222 Th 4.01 IAs IOs | W6sclo-! 
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eh 4.69 75x \0ty ZO ae 
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Soe h 6.29 56s 1.21037 
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215 Th 9.85 0.11 us 6.3 x 10° 




















Quantum Theory of Alpha Decay 


Alpha decay is an example of quantum-mechanical barrier penetration, as we 
discussed in Chapter 5. Suppose it is energetically possible for two neutrons and 
two protons to form an alpha particle, as represented in Figure 12.10. The alpha 
particle is trapped inside the nucleus by a barrier due to the Coulomb energy. The 
height of this barrier Up is the Coulomb potential energy of the alpha particle and 
daughter nucleus at the radius R: 


2(Z — 2)e? 
Are R 





1 14% 
Up = ie, F (12.24) 
which gives 30 to 40 MeV for a typical heavy nucleus. Here g, = 2e is the electric 
charge of the alpha particle, and g, = (Z — 2)e is the electric charge of the nucleus 
after the decay, which is responsible for the Coulomb force. 

Figure 12.15 shows the potential energy barrier encountered by the alpha 
particle as it tries to leave the interior of the nucleus (r < R). The energy of the 
alpha particle is typically in the range of 4—8 MeV, and so it is impossible for the 
alpha particle to surmount the barrier; the only way the alpha particle can escape 
is to “tunnel” through the barrier. A representation of the alpha particle wave 
function as it tunnels through the barrier is shown in Figure 12.15. 

The probability per unit time A for the alpha particle to appear in the laboratory 
is the probability of its penetrating the barrier multiplied by the number of times 
per second the alpha particle strikes the barrier in its attempt to escape. If the 
alpha particle is moving at speed v inside a nucleus of radius R, it will strike the 
barrier as it bounces back and forth inside the nucleus at time intervals of 2R/v. 
In a heavy nucleus with R ~ 6 fm, the a particle strikes the “wall” of the nucleus 
about 107 times per second! 
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energy barrier for an alpha particle. (b) 
A representation of the wave function 
of the alpha particle. 
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FIGURE 12.16 Replacing the Cou- 
lomb barrier for alpha decay with a 
flat barrier of height Up. 
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The probability for the alpha particle to penetrate the barrier can be found by 
solving the Schrédinger equation for the potential energy shown in Figure 12.15. 
To simplify this calculation, we can replace the Coulomb barrier with a “flat” 
barrier, as shown in Figure 12.16. As we discussed in Chapter 5, the probability to 
penetrate a potential energy barrier is determined by the exponential factor e~7*“, 
where L is the thickness of the barrier and where k = ,/(2m/h?)(U) — E) for a 
barrier of height Up and a particle of energy E. The decay probability can then be 
estimated as 


y= en 2kL 
2R 
which includes both the rate at which the particle strikes the barrier and its 
probability to penetrate it. By making suitable rough estimates for the thickness 
and height of the barrier (see Problem 41), you should be able roughly to reproduce 
the range of values for the decay probabilities given in Table 12.2. 

An exact calculation of the decay probability can be done by replacing the 
Coulomb barrier with a series of thin, flat barriers that are chosen to fit the 
Coulomb barrier as closely as possible. This calculation was first done in 1928 by 
George Gamow and was one of the first successful applications of the quantum 
theory. 

Some nuclei can be unstable to the emission of other particles or collections 
of particles. Nuclei that have a large abundance of protons (those at the left-hand 
boundary of the light shaded region of Figure 12.11) may emit protons in a rare 
process similar to alpha decay. In this way they reduce their proton excess and 
move closer to stability. An example of this process is '5;Lugy > '3}Ybgo + p. 

Other nuclei have recently been shown to emit clusters of particles such as 
C, 4C, or 2°Ne. The following example illustrates this process. 


(12.25) 





The nucleus 7”°Ra decays by alpha emission with a half-life 
of 1600 y. It also decays by emitting 'C. Find the Q value 
for '4C emission and compare with that for alpha emission 
(see Example 12.8). 


Solution 

If *?°Ra emits '4C, which contains 6 protons and 8 neu- 
trons, the resulting nucleus is *!*Pb, so the decay process 
is ??°Ra — 7!*Pb + 4C. The Q value can be found from 
Eq. 12.16, where we can again use atomic masses because 
the electron masses cancel. 


O = [m@5Ra) — m2!2Pb) — m(4C)]c2 
= (226.025410 u — 211.991898 u 
—14.003242 u)(931.5 MeV/u) 
= 28.197 MeV 


Even though the Q value far exceeds the Q value for alpha 
decay (4.871 MeV), the Coulomb barrier for '4C decay is 
roughly 3 times higher and thicker than it is for alpha decay 
[change q,q> to 6(Z — 6)e? in Eq. 12.24]. As a result, the 
probability for '*C decay turns out to be only about 107? 
of the probability for alpha decay; that is, ??°Ra emits one 
'4C for every 10° alpha particles. See Problem 42 for a 
calculation of the relative decay probabilities. 
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12.8 BETA DECAY 


In beta decay a neutron in the nucleus changes into a proton (or a proton into a 
neutron); Z and N each change by one unit, but A doesn’t change. The emitted 
particles, which were called beta particles when first observed in 1898, were soon 
identified as electrons. In the most basic beta decay process, a free neutron decays 
into a proton and an electron: n > p + e (plus a third particle, as we discuss later). 

The emitted electron is not one of the orbital electrons of the atom. It also is 
not an electron that was previously present within the nucleus, for as we have seen 
(Example 4.7) the uncertainty principle forbids electrons of the observed energies 
to exist inside the nucleus. The electron is “manufactured” by the nucleus out of 
the available energy. If the rest energy difference between the nuclei is at least 
m,c’, this will be possible. 

In the 1910s and 1920s, beta decay experiments revealed two difficulties. 
First, the decay n> p+e™” appears to violate the law of conservation of 
angular momentum, as we discussed in Section 12.6. Second, measurements 
of the energy of the emitted electrons showed that the energy spectrum of the 
electrons is continuous, from zero up to some maximum value K,,,,, aS shown 
in Figure 12.17. This implies an apparent violation of conservation of energy, 
because all electrons should emerge from the decay n > p+e~ with precisely 
the same energy. Instead, all electrons emerge with less energy, but in varying 
amounts. 

For example, in the decay n > p +e, the QO value is 


Q = (m, — m, — m,)c* = 0.782 MeV (12.26) 


Except for a very small correction, which accounts for the recoil energy of the 
proton, all of this energy should appear as kinetic energy of the electron, and all 
emitted electrons should have exactly this energy. However, experiments in the 
1920s showed that all the emitted electrons have less than this energy —they have 
a continuous range of energies from 0 to 0.782 MeV. 

The problem of this “missing” energy was very puzzling until 1930 when 
Wolfgang Pauli found the ingenious solution to both the apparent violations of 
conservation of angular momentum and energy—he suggested that there is a 
third particle emitted in beta decay. Electric charge is already conserved by the 
proton and electron, so this new particle cannot have electric charge. If it has spin 
1/2, it will satisfy conservation of angular momentum, because we can combine 
the spins of the three decay particles to give !/2, which matches the spin of the 
original decaying neutron. The “missing” energy is the energy carried away by 
this third particle, and the observed fact that the energy spectrum extends all the 
way to the value 0.782 MeV suggests that this particle has a very small mass. 

This new particle is called the neutrino (“little neutral one” in Italian) and has Kmax 
the symbol v. As we discuss in Chapter 14, every particle has an antiparticle, and 0 
the antiparticle of the neutrino is the antineutrino V. It is, in fact, the antineutrino Kinetic energy of electrons 
that is emitted in neutron beta decay. The complete decay process is thus 





Number of electrons 











FIGURE 12.17 Spectrum of electrons 
n>pt+e +4+v (12.27) emitted in beta decay. 
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Neutron decay can also occur in a nucleus, in which a nucleus with Z protons 
and N neutrons decays to a nucleus with Z + | protons and NV — | neutrons: 


9Xy > 7 {Xy_p te +0 (12.28) 
The Q value for this decay is 
O = [mC@X) — mCPX’)]c’ (12.29) 


It can be shown (Problem 23) that the electron masses cancel in calculating Q, so 
it is atomic masses that appear in Eq. 12.29. The antineutrino does not appear in 
the calculation of the O value because its mass is negligibly small (of the order of 
eV/c?, compared with the atomic masses measured in units of 10° MeV/c’). 

The energy released in the decay (the Q value) appears as the energy £,, of the 
antineutrino, the kinetic energy K, of the electron, and a small (usually negligible) 
recoil kinetic energy of the nucleus X’: 


O=E,+K,+Ky XE, +K, (12.30) 


The electron (which must be treated relativistically, because its kinetic energy is 
not small compared with its rest energy) has its maximum kinetic energy when 
the antineutrino has a negligibly small energy. Figure 12.17 shows the energy 
distribution of electrons emitted in a typical negative beta decay. The electron 
and neutrino share the decay energy Q; the kinetic energy of the electron (equal 
to OQ — K,) ranges from 0 (when the neutrino has its maximum energy, E\, = Q) 
to O (when E,, = 0). 
Another beta decay process is 


pont+et+pv (12.31) 


in which a positive electron, or positron, is emitted. The positron 1s the antiparticle 
of the electron; it has the same mass as the electron but the opposite electric 
charge. The neutrino emitted in this process is similarly the antiparticle of the 
antineutrino that is emitted in neutron beta decay. 

Proton beta decay has a negative O value, and so it is never observed in nature 
for free protons. (This is indeed fortunate—if the free proton were unstable to 
beta decay, stable hydrogen atoms, the basic material of the universe, could not 
exist!) However, protons in some nuclei can undergo this decay process: 


aXy > 7 AXvar tet tv (12.32) 
The Q value for this process is (Problem 23) 


O = [m(“X) — m(*X’) — 2m, ]c? (12.33) 


in which the masses are atomic masses. In this case, the positron and neutrino 
share the decay energy Q (again neglecting the small recoil energy of the nucleus 
X’). Figure 12.18 shows the energy distribution of positrons emitted in a typical 
positive beta decay. 

A nuclear decay process that competes with positron emission is electron 
capture; the basic electron capture process is 


pte -n+v (12.34) 


in which a proton captures an atomic electron from its orbit and converts into a 
neutron plus a neutrino. The electron necessary for this process is one of the inner 
orbital electrons in an atom, and we identify the capture process by the shell from 
which the captured electron comes: K-shell capture, L-shell capture, and so forth. 
(The electronic orbits that come closest to, or even penetrate, the nucleus have the 
higher probability to be captured.) In nuclei the process is 


Xy +e > 7 TXngi tv (12.35) 
and the O value, using atomic masses, is 
O = [m(@*X) — m“*X’)]c? (12.36) 


In this case, neglecting the small initial kinetic energy of the electron and the 
recoil energy of the nucleus, the neutrino takes all of the available final energy: 


E,=Q (12.37) 


In contrast to other beta-decay processes, a monoenergetic neutrino is emitted in 
electron capture. 

Table 12.3 gives some typical beta decay processes, along with their O values 
and half-lives. 


TABLE 12.3 Typical Beta Decay Processes 


























Decay Type QO (MeV) t2 
PO > OF+e7+0 [aim 4.82 27s 
Ty > HP+e— +7 ee 1.19 3.6 x 10!° y 
3 Al—> >Mg+et+v Be 3.26 ws 
1247 > 4Te+ et +y Bo 2.14 42d 
BO+e > PN+0 EC 275 122s 
MOTT 4 e= Ss VM B ee yy EC 0.31 129d 
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FIGURE 12.18 — Spectrum of posi- 
trons emitted in beta decay. 
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| Example 12.10 





?3Ne decays to **Na by negative beta emission. What is 
the maximum kinetic energy of the emitted electrons? 


Solution 

This decay is of the form given by Eq. 12.28, 7*Ne > 
?3Na+e7 +0, and the Q value is found from Eq. 12.29, 
using atomic masses: 


O = [mC@3Ne) — mC?Na)]c” 
= (22.994467 u — 22.989769 u)(931.5 MeV/u) 
= 4.376 MeV 


Neglecting the small correction for the kinetic energy of 
the recoiling nucleus, the maximum kinetic energy of the 
electrons is equal to this value (which occurs when the 
neutrino has a negligible energy). 
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40K is an unusual isotope, in that it decays by negative 
beta emission, positive beta emission, and electron capture. 
Find the QO values for these decays. 


Solution 

The process for negative beta decay is given by Eq. 12.28, 
40K — ?Ca+e7 +0, and the O value is found from 
Eq. 12.29 using atomic masses: 


Op- = [m(K) — m(“Ca)]c? 
= (39.963998 u — 39.962591 u)(931.5 MeV/u) 
= 1.311 MeV 


Equation 12.32 gives the decay process for positive beta 
emission, “°K — *°Ar+ e+ + v, and the Q value is given 


by Eq. 12.33: 


Ops = [mC°K) — m(“Ar) — 2m, Je? 
= [39.963998 u — 39.962383 u — 2(0.000549 u)] 
x (931.5 MeV/u) 
= 0.482 MeV 


For electron capture, *K+e7 > *Ar+v, and from 
Eq. 12.36: 

= (39.963998 u — 39.962383 u)(931.5 MeV/u) 

= 1.504 MeV 


12.9 GAMMA DECAY AND NUCLEAR 


EXCITED STATES 





Following alpha or beta decay, the final nucleus may be left in an excited state. 
Just as an atom does, the nucleus will reach its ground state after emitting one 
or more photons, known as nuclear gamma rays. The energy of each photon is 
the energy difference between the initial and final nuclear states, less a negligibly 
small correction for the recoil kinetic energy of the nucleus. The energies of 
emitted gamma rays are typically in the range of 100 keV to a few MeV. Nuclei 
can likewise be excited from the ground state to an excited state by absorbing a 
photon of the appropriate energy, in a process similar to the resonant absorption 


by atomic states. 
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Figure 12.19 shows a typical energy-level diagram of excited nuclear states 
and some of the gamma-ray transitions that can be emitted. Typical values for the 
half-lives of the excited states are 10~° to 10~!” s, although there are occasional 
cases of excited states with half-lives of hours, days, or even years. 

When a gamma-ray photon is emitted, the nucleus must recoil to conserve 
momentum. The photon has energy £,, and momentum p,, = £,,/c. The nucleus 
recoils with momentum pp. If the nucleus is initially at rest, then momentum 
conservation requires that py = p,, in magnitude (and that the nucleus recoil in 
a direction opposite to that of the gamma ray). The recoil kinetic energy Kp is 
small, so that nonrelativistic equations can be used for the nucleus (of mass M): 
mR wm 
~ 2M 2M 2Mc? 





KR (12.38) 
For a medium-mass nucleus of A = 100 and a large gamma-ray energy of 1 MeV, 
the recoil kinetic energy is only 5eV. Suppose the gamma ray is emitted when 
the nucleus jumps from an initial state with energy £; to a final state with energy 
E,. Conservation of energy then gives 4; = E; + E,, + Kp, so the energy of the 
emitted gamma ray is 





BE, = E, — Ey — Kg SE, — Es (12.39) 
The gamma-ray energy is equal to the difference between the initial and final 
energy states, because the recoil kinetic energy of the nucleus is negligibly small. 

In calculating the energies of alpha and beta particles emitted in radioactive 
decays, we have assumed that no gamma rays are emitted. If there are gamma 
rays emitted, the available energy (O value) must be shared between the other 
particles and the gamma ray, as the following example shows. 
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FIGURE 12.19 Some gamma rays 
emitted following beta decay. 





N beta decays to an excited state of !*C, which subse- 
quently decays to the ground state with the emission of 
a 4.43-MeV gamma ray. What is the maximum kinetic 
energy of the emitted beta particle? 


Solution 

To determine the O value for this decay, we first need to 
find the mass of the product nucleus !*C in its excited state. 
In the ground state, !*C has a mass of 12.000000 u, so its 
mass in the excited state (indicated by !*C*) is 


4.43 MeV 


hope, — 
m('2C*) Q00000 Us 5 sai 


= 12.004756u 
In this decay, a proton is converted to a neutron, so it must 


be an example of positron decay. The OQ value is, according 
to Eq. 12.33, 


O = [m(?N) — m(2C*) — 2m, Je? 
= [12.018613 u — 12.004756 u — 2(0.000549 u)] 
x (931.5 MeV/u) 
= 11.89MeV 


(Notice that we could have just as easily found the O value 
by first finding the Q value for decay to the ground state, 
16.32 MeV, and then subtracting the excitation energy of 
4.43 MeV, because the decay to the excited state has that 
much less available energy.) 

Neglecting the small correction for the recoil kinetic 
energy of the '7C nucleus, the maximum electron kinetic 
energy is 11.89 MeV. 
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FIGURE 12.20 Nuclear vibrational 
states in the nucleus !?°Te. The states 
are labeled with the vibrational quan- 
tum number NV. Note that the states are 
nearly equally spaced, as is expected 
for vibrations. 


Nuclear Excited States 


The study of nuclear gamma emission is an important tool of the nuclear physicist; 
the energies of the gamma rays can be measured with great precision, and they 
provide a powerful means of deducing the energies of the excited states of nuclei. 
This type of nuclear spectroscopy is very similar to the methods of molecular 
spectroscopy discussed in Chapter 9. In fact, the nuclear excited states can be 
formed in ways that are similar to molecular excited states: 


1. Proton or Neutron Excitation Nuclear excited states can be formed when 
a proton or a neutron is excited from a filled state to one of the empty states 
shown in Figure 12.8 or 12.9, just as in molecules we can form an excited state 
by promoting an electron from a lower state to one of the empty molecular 
orbitals. When a proton or neutron drops from an excited state to a lower state, 
a gamma-ray photon is emitted. The energy of the photon is equal to the energy 
difference between the states (neglecting the small recoil kinetic energy of the 
nucleus). To estimate the average energy for this type of excitation, we note 
from Figure 12.8 that 73 neutrons occupy an energy of 37.0 MeV, so the average 
spacing of the filled levels is (37.0 MeV)/73 = 0.5 MeV. The spacing between the 
empty states, among which gamma rays are emitted, should be about the same. 


2. Nuclear Vibrations The nucleus can vibrate like a jiggling water droplet. 
The vibrational excited states are equally spaced, just like the molecular vibrational 
states shown in Figure 9.22. Unlike a molecule, a nucleus vibrates like an 
incompressible fluid—for example, if the “equator” bulges outward, the “poles” 
must move inward to keep the density constant. The separation of the equally 
spaced vibrational states is about 0.5—1 MeV. Figure 12.20 shows an example of 
some vibrational nuclear excited states. Although the selection rules for photon 
emission in nuclei are not as strongly restrictive as they are in molecules, nuclei 
in higher vibrating states usually jump to lower vibrating states by changing the 
vibrational quantum number by one unit and emitting a gamma-ray photon in the 
process. 


3. Nuclear Rotations The nucleus can rotate, showing the same L(L+1) 
spacing as a molecule (see Figure 9.25). Figure 12.21 shows an example of 
rotational nuclear excited states. The spacing between the rotational ground state 
and the first rotational excited state is typically 0.05—0.1 MeV. (Note that in 
nuclei, as in molecules, the rotational spacing is generally much smaller than the 
vibrational spacing.) Nuclei in higher rotational states can jump to lower rotational 
states by emitting gamma-ray photons; in the case of nuclei, the selection rule 
restricting the change in the rotational quantum number to one unit, which was 
strongly followed by molecules, does not strongly apply to nuclei. In nuclei, 
the rotational quantum number generally changes by one or two units when 
gamma-ray photons are emitted in transitions between the nuclear rotational 
states. 


*Niuclear Resonance 


One way of studying atomic systems is to do resonance experiments. In such 
experiments, radiation from a collection of atoms in an excited state is incident 


*This is an optional section that may be skipped without loss of continuity. 
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on a collection of identical atoms in their ground state. The ground-state atoms 
can absorb the photons and jump to the corresponding excited state. However, 
as we have seen, the emitted photon energy is less than the transition energy 
by the recoil kinetic energy Kp; moreover, it is less than the photon energy 
required for resonance by 2Kp, because the absorbing atom must recoil also. 
The absorption experiment is still possible, because the excited states don’t have 
“exact” energies—a state with a mean lifetime t has an energy uncertainty AE 
that is given by the uncertainty relationship: AEt ~ h. That is, the state lives on 
the average for a time T, and during that time we can’t determine its energy to an 
accuracy less than AE. For typical atomic states, t ~ 107° s,so AE ~ 1077 eV. 
Because Kp, which is of the order of 10~!° eV, is much less than the width AE, 
the “shift” caused by the recoil is not large, and the widths of the emitting and 
absorbing atomic states cause sufficient overlap for the absorption process to 
occur. Figure 12.22 illustrates this case. 

The situation is different for nuclear gamma rays. A typical lifetime might be 
10~!° s, and so the widths are the order of AE ~ 107° eV. The photon energies 
are typically 100 keV = 10° eV, and so Kg is of order 1 eV. This situation is 
depicted in Figure 12.23, and you can immediately see that because Kp is so 
much larger than the width AZ, no overlap of emitter and absorber is possible, so 
resonance absorption cannot occur. 

In 1958, it was discovered that the overlap of the emitter and absorber can be 
restored by placing the radioactive nuclei and the absorbing nuclei in crystals. 
The crystalline binding energies are large compared with Kp, so the individual 
atoms are held tightly to their positions in the crystal lattice and are not free to 
recoil; if any recoil is to occur, it must be the whole crystal that recoils. This 
effect is to make the mass M that appears in Eq. 12.38 not the mass of an atom, 
but the mass of the entire crystal, perhaps 107° times larger than an atomic mass. 
(As an analogy, imagine the difference between striking a brick with a baseball 
bat, and striking a brick wall!) Once again the recoil kinetic energy is made small, 
and resonant absorption can occur (Figure 12.24). For this discovery, Rudolf 
Mossbauer was awarded the 1961 Nobel Prize in physics, and the process of 
achieving nuclear resonance by embedding the emitting and absorbing nuclei in 
crystal lattices is now known as the Méssbauer effect. 
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FIGURE 12.21 An example of 


nuclear rotational states in the nucleus 
'65Ho. The states are labeled with 
the rotational quantum number L. The 
energies closely follow the expected 
L(L + 1) spacing. 
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FIGURE 12.25 Mossbauer effect 
apparatus. A source of gamma rays 
is made movable, in order to Doppler- 
shift the photon energies. The intensity 
of radiations transmitted through the 
absorber is measured as a function of 
the speed of the source. 
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FIGURE 12.26 Typical results in a 
M6ssbauer effect experiment. A 
velocity of 2cm/s Doppler shifts the 
gamma rays enough to move the 
emission and absorption energies off 
resonance. 


The small remaining difference between the emission and absorption energies 
can be eliminated to obtain complete overlap by Doppler-shifting either the 
emission or absorption energies. The Doppler-shifted frequency when a source 
moves toward the observer at speed v is given by Eq. 2.22, f’ =f(1+v/c), 
where we ignore the ,/1 — v?/c? term because v <c. Using E =Af for the 
photon energy, we have 


E' = E(1+v/c) (12.40) 
If we take the width AZ as a representative estimate of how far we would 
like to Doppler-shift the photon energy, then E’ ~ E+ AE, and so E+ AE = 
E+ E(v/c). Solving for v, we find 


A 
ge (12.41) 


Estimating AE ~ 107° eV (the width of the state) and E ~ 100 keV (the energy 
of the photon), we have 
7 : 10-5 eV 
v=(3 x 10 m/s) ev = 3cm/s 
Such low speeds can be easily and accurately produced in the laboratory. 

Figure 12.25 shows a diagram of the apparatus to measure the Mossbauer 
effect. The resonant absorption is observed by looking for decreases in the 
number of gamma rays that are transmitted through the absorber. At resonance, 
more gamma rays are absorbed and so the transmitted intensity decreases. Typical 
results are shown in Figure 12.26. 

The Mossbauer effect is an extremely precise method for measuring small 
changes in the energies of photons. In one particular application, the Zeeman 
splitting of nuclear (not atomic) states can be observed. When a nucleus is placed 
in a magnetic field, the Zeeman effect causes an energy splitting of the nuclear m 
states, similar to the atomic case. However, nuclear magnetic moments are about 
2000 times smaller than atomic magnetic moments, and a typical energy splitting 
would be about 10~° eV. To observe such an effect directly we would need to 
measure photon energies to 1 part in 10!! (a photon energy of 10° eV is shifted 
by 10~° eV), but using the Méssbauer effect, this is not difficult. 

In Chapter 15 we discuss another application of this extremely precise tech- 
nique, in which the energy gained when a photon “falls” through several meters 
of the Earth’s gravitational field is measured in order to test one prediction of 
Einstein’s general theory of relativity. 


12.10 NATURAL RADIOACTIVITY 


All of the elements beyond the very lightest (hydrogen and helium) were produced 
by nuclear reactions in the interiors of stars. These reactions produce not only 
stable elements, but radioactive ones as well. Most radioactive elements have 
half-lives that are much smaller than the age of the Earth (about 4.5 x 10° y), so 
those radioactive elements that may have been present when the Earth was formed 
have decayed to stable elements. However, a few of the radioactive elements 


created long ago have half-lives that are as large as or even greater than the age of 
the Earth. These elements can still be observed to undergo radioactive decay and 
account for part of the background of natural radioactivity that surrounds us. 

Radioactive decay processes either change the mass number A of a nucleus 
by four units (alpha decay) or don’t change A at all (beta or gamma decay). 
A radioactive decay process can be part of a sequence or series of decays if a 
radioactive element of mass number A decays to another radioactive element 
of mass number 4 or A — 4. Such a series of processes will continue until a 
stable element is reached. A hypothetical such series is illustrated in Figure 12.27. 
Because gamma decays don’t change Z or A, they are not shown; however, most 
of the alpha and beta decays are accompanied by gamma-ray emissions. 

The A values of the members of such decay chains differ by a multiple of 
4 (including zero as a possible multiple) and so we expect four possible decay 
chains, with A values that can be expressed as 4n,4n + 1,4n + 2, and 4n + 3, 
where n is an integer. One of the four naturally occurring radioactive series is 
illustrated in Figure 12.28. Each series begins with a relatively long-lived member, 
proceeds through many @ and # decays, which may have very short half-lives, 
and finally ends with a stable isotope. Three of these series begin with isotopes 
having half-lives comparable to the age of the Earth, and so are still observed 
today. The neptunium series (4n + 1) begins with 777Np, which has a half-life of 
“only” 2.1 x 10° y, much less than the 4.5 x 10° y since the formation of the 
Earth. Thus all of the 7>”Np that was originally present has long since decayed to 
209 Bi 
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FIGURE 12.28 The *3°U decay chain. The diagonal lines represent a 
decays, and the horizontal lines show 6 decays. 
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FIGURE 12.27 An example of a 
hypothetical radioactive decay chain. 
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| Example 12.13 





Compute the Q value for the 78U — 7°Pb decay chain, 
and find the rate of energy production per gram of uranium. 


Solution 

Because A changes by 32, there must be 8 alpha decays in 
the chain. These 8 alpha decays would decrease Z by 16 
units, from 92 to 76. However, the final Z must be 82, so 
there must also be 6 beta decays in the chain. We recall 
that for B~ decays, the electron masses combine with the 
nuclear masses in the computation of the QO value and we 
can therefore use atomic masses. Thus for the entire decay 
chain, 


O = [mC8U) — mCPb) — 8m(*He) ]c? 
= [238.050788 u — 205.974465 u—8(4.002603 u)] 
x (931.5 MeV/u) 
= 51.7MeV 


The half-life of the decay is 4.5 x 10° y, so A, the decay 
probability per atom, is 





ind 0.693 
~ ti 4.5 x 10° y)B.16 x 107 s/y) 
=49x 1078s"! 


One gram of *8U is = mole and therefore contains 


x33 X 6 x 1073 atoms. The decay rate (activity) of the *38U 
is given by the decay probability per atom per unit time 
multiplied by the number of atoms: 





a=AN 
d 1 
= (4.9 x 10718 SY \ ( __ x 6 x 103 atoms 
atom: s 238 
= 12,000 decays/s 


Each decay releases 51.7 MeV, and so the rate of energy 
production is 


oe Ag (a ee ee 


12,000 
decay MeV eV 


x l-7: 











=1.0x1077'W 





This may seem like a very small rate of energy release, but if the energy were 


| Example 12.14 


to appear as thermal energy and were not dissipated by some means (radiation or 
conduction to other matter, for example), the 1-g sample of *°U would increase 
in temperature by 25°C per year and would be melted and vaporized in the order 
of one century! This calculation suggests that we can perhaps account for some 
of the internal heat of planets through natural radioactive processes. 

If we examine a sample of uranium-bearing rock, we can find the ratio of 
238 atoms to 7°°Pb atoms. If we assume that all of the 7°°Pb was produced 
by the uranium decay and that none was present when the rock was originally 
formed (assumptions that must be examined with care both theoretically and 
experimentally), then this ratio can be used to find the age of the sample, as shown 
in the following example. 





Three different rock samples have ratios of numbers of 
38 atoms to "Pb atoms of 0.5, 1.0, and 2.0. Compute 


the ages of the three rocks. 


Solution 


present today, and N) — Noe“! is the number that have 
decayed and are presently observed as 7°°Pb. The ratio R 
of 78U to 2°°Pb is thus 


number of 77°U 


Because all of the other members of the uranium series 
have half-lives that are much shorter than the half-life of 
38U) (4.5 x 10° y), we ignore the intervening decays and 
consider only the 77°U decay. Let No be the original number 
of *38U atoms, so that Nye~“’ is the number that are still 


~ number of 2°°Pb 
7 Ne 
~ Ny — Noe~*t 
1 
et —] 


Solving for ¢ and recalling that A = 0.693/t)/>, we find: 


hyo 1 
i= Inf—+1 
0.693 G 7 ) 
We can then obtain the values of ¢ corresponding to the 
three values of R, 





(12.42) 


R=0.5 t= 7.1x 10°y 
R=1.0 t= 45x 10° y 
R=2.0 t= 2.6x 10°y 
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The oldest rocks on Earth, dated by similar means, have 
ages of about 4.5 x 10° y. The age of the first rock ana- 
lyzed above, 7.1 x 10° y, suggests either that the rock had 
an extraterrestrial origin, or else that our assumption of 
no initial 7°°Pb was incorrect. The age of the third rock 
suggests that it solidified only 2.6 x 10° y ago; previous to 
that time it was molten and the decay product ?°°Pb may 
have “boiled away” from the 77°U. 





There are a number of other naturally occurring radioactive isotopes that are 
not part of the decay chain of the heavy elements. A partial list is given in 
Table 12.4; some of these can also be used for radioactive dating. 

Other radioactive elements are being produced continuously in the Earth’s 


TABLE 12.4 Some Naturally Occur- 
ring Radioactive Isotopes 








atmosphere as a result of nuclear reactions between air molecules and the 
high-energy particles known as “cosmic rays.” The most notable and useful of 
these is '4C, which beta decays with a half-life of 5730 y. When a living plant 
absorbs CO, from the atmosphere, a small fraction (about 1 in 10!) of the carbon 
atoms is '*C, and the remainder is stable '!*C (99%), and !3C (1%). When the plant 
dies, its intake of '*C stops, and the '*C decays. If we assume that the composition 
of the Earth’s atmosphere and the flux of cosmic rays have not changed 
significantly in the last few thousand years, we can find the age of specimens of 
organic material by comparing their '4C/'*C ratios to those of living plants. The 
following example shows how this radiocarbon dating technique is used. 
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Isotope th 
aug 1.25 x 10° y 
w7Rb 48x 10!°y 
Nb 32 107 ¥ 
"ed 9x 10h y 
in Sx 10 y 
peeled Itc 1101 yy, 
eg 3.6 x 10% y 
'87Re 4x 10! y 
22 Th VAl x 101° y 








(a) A sample of carbon dioxide gas from the atmosphere fills 
a vessel of volume 200.0 cm? to a pressure of 2.00 x 104 Pa 
(1 Pa = 1 N/m”, about 107° atm) at a temperature of 295 
K. Assuming that all of the '4C beta decays were counted, 


0.693 


If the fraction of '*C atoms is 10~!?, there are 9.82 x 108 
atoms of '4C present. The activity is 





C= 


9.82 x 108 





how many counts would be accumulated in one week? 
(6) An old sample of wood is burned, and the resulting 
carbon dioxide is placed in an identical vessel at the same 
pressure and temperature. After one week, 1420 counts 
have been accumulated. What is the age of the sample? 


Solution 
(a) We first find the number of atoms present in the vessel, 
using the ideal gas law: 


PV _ (2.00 x 104 N/m)(2.00 x 10~4 m3) 
kT (1.38 x 107-23 J/K)(295 K) 
= 9.82 x 107° atoms 


N= 





N= 
(5730 y)(3.16 x 107 s/y) 
= 3.76 x 10~? decays/s 


In one week the number of decays is 2280. 

(b) An identical sample that gives only 1420 counts must 
be old enough for only 1420/2280 of its original activity to 
remain. With 1420 = 2280e~“, we have 


1. (2280\ 5730 22 
t=—In = via] | 50009 
x \ 1420) 0.693 - \ 1420 








402 Chapter 12 | Nuclear Structure and Radioactivity 








Section Section 
Nuclear radius R = R)A'/?, Ry = 1.2 fm 122 Q value of O = [mx — (my + m,)]c? 12.6 
Nuclear B=[Nm,+Zm(CH,) —mG@Xy)]2 12.3 een 
binding energy Se eee 

, 4 2 

Proton Sy = [mG=1X4,) + mCH) — mEXy)Ie2 12.3 a of — O= [m(X) — MX) — mCHe)Je" 12.7 
separation erDe Oeeey. 
energy Kinetic energy K, = O(A — 4)/A 12.7 
Neutron S, = [m@2Xy_1) +m, —mAXyJJe2 12.3 | ofalpha 
separation particle 
energy Qvaluesof  Qg- = [m(*X) — m@X)]c’, 12.8 
Range of me* = hic/x 12.4 beta decay Ont = [m(*X) — m@X') — 2m, Je? 
exchanged Recoil in Kp = Ey, /2Mc? 12.9 
particle gamma decay 
Activity a= 1N,) = In2/ty) = 0.693/t, /> 12.5 
Radioactive N=Ne,a=ae™ 12.5 
decay law 


The magnetic dipole moment of a deuterium nucleus is about 
0.0005 Bohr magneton. What does this imply about the pres- 
ence of an electron in the nucleus, as the proton-electron 
model requires? 

Suppose we have a supply of 20 protons and 20 neutrons. Do 
we liberate more energy if we assemble them into a single 
40Ca nucleus or into two ?°Ne nuclei? 

Atomic masses are usually given to a precision of about the 
sixth decimal place in atomic mass units (u). This is true for 
both stable and radioactive nuclei, even though the uncer- 
tainty principle requires that an atom with a lifetime At has 
a rest energy uncertain by i/At. Based on the typical life- 
times given for nuclear decays, are we justified in expressing 
atomic masses to such precision? At what lifetimes would 
such precision not be justified? 

Only two stable nuclei have Z>N. (a) What are these 
nuclei? (b) Why don’t more nuclei have Z > N? 

In a deuterium nucleus, the proton and neutron spins can be 
either parallel or antiparallel. What are the possible values of 
the total spin of the deuterium nucleus? (It is not necessary 
to consider any orbital angular momentum.) The magnetic 
dipole moment of the deuterium nucleus is measured to be 
nonzero. Which of the possible spins is eliminated by this 
measured value? 

Why is the binding energy per nucleon relatively constant? 
Why does it deviate from a constant value for low mass 
numbers? For high mass numbers? 
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A neutron, which has no electric charge, has a magnetic 
dipole moment. How is this possible? 

The electromagnetic interaction can be interpreted as an 
exchange force, in which photons are the exchanged parti- 
cle. What does Eq. 12.8 imply about the range of such a 
force? Is this consistent with the conventional interpretation 
of the electromagnetic force? What would you expect for 
the rest energy of the exchanged particle that carries the 
gravitational force? 

What is meant by assuming that the decay constant i is 
a constant, independent of time? Is this a requirement of 
theory, an axiom, or an experimental conclusion? Under 
what circumstances might A change with time? 

If we focus our attention on a specific nucleus in a radioac- 
tive sample, can we know exactly how long that nucleus 
will live before it decays? Can we predict which half of the 
nuclei in a sample will decay during one half-life? What part 
of quantum physics is responsible for this? 

A certain radioactive sample is observed to undergo 10,000 
decays in 10 s. Can we conclude that a = 1000 decays/s if 
(a) ty > 108; (6) tp = 108; (c) ty). K 108? 

Suppose we wish to do radioactive dating of a sample whose 
age we guess to be ¢. Should we choose an isotope whose 
half-life is (a) >> t; (b) ~ t; or (c) « f? 

The alpha particle is a particularly tightly bound nucleus. 
Based on this fact, explain why heavy nuclei alpha decay 
and light nuclei don’t. 
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Can you suggest a possible origin for the helium gas that is 
part of the Earth’s atmosphere? 

Estimate the recoil kinetic energy of the residual nucleus 
following alpha decay. (This energy is large enough to drive 
the residual nucleus out of certain radioactive sources; if 
the residual nucleus is itself radioactive, there is the chance 
of spread of radioactive material. A thin coating over the 
source is necessary to prevent this.) 

Why does the electron energy spectrum (Figure 12.17) look 
different from the positron energy spectrum (Figure 12.18) 
at low energies? 

Will electron capture always be energetically possible when 
positron beta decay is possible? Will positron beta decay 
always be energetically possible when electron capture is 
possible? 

All three beta decay processes involve the emission of neu- 
trinos (or antineutrinos). In which processes do the neutrinos 
have a continuous energy spectrum? In which is the neutrino 
monoenergetic? 

Neutrinos always accompany electron capture decays. What 
other kind of radiation always accompanies electron cap- 
ture? (Hint: It is not nuclear radiation.) What other kind of 
nonnuclear radiation might accompany B~ or Bt decays in 
bulk samples? 

The positron decay of !°O goes directly to the ground state 
of !°N; no excited states of !°N are populated and no y rays 


Nuclear Constituents 


Give the proper isotopic symbols for: (a) the isotope of 
fluorine with mass number 19; (b) an isotope of gold with 
120 neutrons; (c) an isotope of mass number 107 with 60 
neutrons. 

Tin has more stable isotopes than any other element; they 
have mass numbers 114, 115, 116, 117, 118, 119, 120, 122, 
124. Give the symbols for these isotopes. 


Nuclear Sizes and Shapes 


(a) Compute the Coulomb repulsion energy between two 
nuclei of !°O that just touch at their surfaces. (b) Do the 
same for two nuclei of 8U. 

Find the nuclear radius of (a) 7 Au: (b) 4He; (c) °Ne. 


Nuclear Masses and Binding Energies 


Find the total binding energy, and the binding energy per 
nucleon, for (a) ?°°Pb; (b) !73Cs; (c) Zr; (d) °°Co. 

Find the total binding energy, and the binding energy per 
nucleon, for (a) *He; (b) ?°Ne; (c) *°Ca; (2) °>Mn. 
Calculate the total nuclear binding energy of 7He and 3H. 
Account for any difference by considering the Coulomb 
interaction of the extra proton of *He. 
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follow the beta decay. Yet a source of !*O is found to emit y 
rays of energy 0.51 MeV. Explain the origin of these y rays. 
Would **Nb be a convenient isotope to use for determining 
the age of the Earth by radioactive dating? (See Table 12.4.) 
What about !'Cd? 

The natural decay chain 735U — °98Pb consists of several 
alpha decays, which decrease A by 4 and Z by 2, and neg- 
ative beta decays, which increase Z by 1. (See Example 
12.13.) As shown in Figures 12.27 and 12.28, sometimes a 
decay chain can proceed through different branches. Does 
the number of alpha decays and beta decays in the chain 
depend on this branching? 

It has been observed that there is an increased level of radon 
gas (Z = 86) in the air just before an earthquake. Where 
does the radon come from? How is it produced? How is it 
released? How is it detected? 

Which of the decay processes discussed in this chapter 
would you expect to be most sensitive to the chemical state 
of the radioactive sample? 

In Figure 12.26, only 1% of the gamma intensity is absor- 
ped, even at resonance. For complete resonance, we would 
expect 100% absorption. What factors might contribute to 
this small absorption? 


Find the neutron separation energy of: (a) '70; (b) "Li; 
(c)>’Fe. 
Find the proton separation energy of: (a) He; (b) °C; 
(c) Ca. 


The Nuclear Force 


The nuclear attractive force must turn into a repulsion at very 
small distances to keep the nucleons from crowding too close 
together. What is the mass of an exchanged particle that will 
contribute to the repulsion at separations of 0.25 fm? 

The weak interaction (the force responsible for beta decay) 
is produced by an exchanged particle with a mass of roughly 
80 GeV. What is the range of this force? 


Quantum States in Nuclei 


Determine the depth of the proton and neutron potential 
energy wells for (a) !°O; (b) 5U. 

The two-neutron separation energies of !®°Dy and !®Dy are, 
respectively, 15.4 MeV and 13.9 MeV, and the two-proton 
separation energies of !*8Dy and '°*Dy are, respectively, 
12.4 MeV and 14.8 MeV. From these data alone, determine 
whether alpha decay is energetically allowed for !°°Dy and 
164Dy, 
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Radioactive Decay 


What fraction of the original number of nuclei present in a 
sample will remain after (a) 2 half-lives; (b) 4 half-lives; 
(c) 10 half-lives? 

A certain sample of a radioactive material decays at a rate 
of 548 per second at ¢ = 0. At t = 48 minutes, the counting 
rate has fallen to 213 per second. (a) What is the half-life of 
the radioactivity? (6) What is its decay constant? (c) What 
will be the decay rate at t = 125 minutes? 

What is the decay probability per second per nucleus of a 
substance with a half-life of 5.0 hours? 

Tritium, the hydrogen isotope of mass 3, has a half-life 
of 12.3 y. What fraction of the tritium atoms remains in a 
sample after 50.0 y? 

Suppose we have a sample containing 2.00 mCi of radioac- 
tive '5'I (41). = 8.04 d). (a) How many decays per second 
occur in the sample? (b) How many decays per second will 
occur in the sample after four weeks? 

Ordinary potassium contains 0.012 percent of the naturally 
occurring radioactive isotope “°K, which has a half-life of 
1.3 x 10° y. (a) What is the activity of 1.0 kg of potassium? 
(b) What would have been the fraction of *°K in natural 
potassium 4.5 x 10° y ago? 


Alpha Decay 


Derive Eq. 12.23 from Eqs. 12.21 and 12.22. 

For which of the following nuclei is alpha decay permitted? 
(a) !°Bi; (b) 7 Hg; (c) At. 

Find the kinetic energy of the alpha particle emitted in the 
decay of 4U. 


Beta Decay 


Derive Eqs. 12.29, 12.33, and 12.36. 

Find the maximum kinetic energy of the electrons emitted 
in the negative beta decay of !!Be. 

™Se decays by electron capture to As. Find the energy of 
the emitted neutrino. 

150 decays to '5N by positron beta decay. (a) What is the 
QO value for this decay? (b) What is the maximum kinetic 
energy of the positrons? 


Gamma Decay and Nuclear Excited States 


The nucleus !8Hg has excited states at 0.412 and 
1.088 MeV. Following the beta decay of !°8Au to !8Hg, 
three gamma rays are emitted. Find the energies of these 
three gamma rays. 

Compare the recoil energy of a nucleus of mass 200 that 
emits (a) a 5.0-MeV alpha particle, and (6) a 5.0-MeV 
gamma ray. 

A certain nucleus has the following sequence of rotational 
states E, (energies in keV): E, = 0, E, = 100.1, Ey, = 
300.9, E; = 603.6, and £, = 1010.0. Assuming that the 


emitted gamma rays occur only from changes of one or 
two units in the rotational quantum number, find all possible 
photon energies that can be emitted from these states. Sketch 
the excited states, showing the allowed transitions. 


12.10 Natural Radioactivity 
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The radioactive decay of ***Th leads eventually to stable 
208Pb, A certain rock is examined and found to contain 3.65 
g of ?°Th and 0.75 g of 2°8Pb. Assuming all of the Pb was 
produced in the decay of Th, what is the age of the rock? 
The 4n radioactive decay series begins with 735Th and 
ends with 2°8Pb. (a) How many alpha decays are in the 
chain? (See Question 22.) (b) How many beta decays? 
(c) How much energy is released in the complete chain? 
(d) What is the radioactive power produced by 1.00kg of 
Th (ty 2 = 1.40 x 10!° yy? 

A piece of wood from a recently cut tree shows 12.4 4C 
decays per minute. A sample of the same size from a tree cut 
thousands of years ago shows 3.5 decays per minute. What 
is the age of this sample? 


General Problems 
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Si. 


Figure 12.2 suggests that the Rutherford scattering formula 
fails for 60° scattering when K is about 28 MeV. Use the 
results derived in Chapter 6 to find the closest distance 
between alpha particle and nucleus for this case, and com- 
pare with the nuclear radius of °8Pb. Suggest a possible 
reason for any discrepancy. 

Assuming the nucleus to diffract like a circular disk, use 
the data shown in Figure 12.3 to find the nuclear radius 
for '*C and '6O0. How does changing the electron energy 
from 360 MeV to 420 MeV affect the deduced radius for 
160? (Hint: Use the extreme relativistic approximation from 
Chapter 2 to relate the electron’s energy and momentum to 
find its de Broglie wavelength.) 

A radiation detector is in the form of a circular disc of 
diameter 3.0 cm. It is held 25 cm from a source of radiation, 
where it records 1250 counts per second. Assuming that the 
detector records every radiation incident upon it, find the 
activity of the sample (in curies). 

What is the activity of a container holding 125 cm? of tri- 
tium (7H, tj. = 12.3 y) at a pressure of 5.0 x 10° Pa (about 
5 atm) at T = 300 K? 

With a radioactive sample originally of Nj atoms, we could 
measure the mean, or average, lifetime t of a nucleus by 
measuring the number N, that live for a time ¢, and then 
decay, the number N, that decay after , and so on: 


1 
t= ng Oh So NG i Atay 


(a) Show that this is equivalent to t= i et dt. 
(6) Show that t = 1/A. (c) Is t longer or shorter than ¢, ;9? 
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Complete the following decays: 

(a) 77Si > 77AL+ 

(b) “As > ™4Se+ 

(c) US a+ 

(d) ?Mo+e7 > 

(e) 1317 > 131Xe + 

239Py decays by alpha emission with a half-life of 
2.41 x 10+ y. Compute the power output, in watts, that 
could be obtained from 1.00 gram of 73°Pu. 

228Th alpha decays to an excited state of 7**Ra, which in turn 
decays to the ground state with the emission of a 217-keV 
photon. Find the kinetic energy of the alpha particle. 

By replacing the Coulomb barrier in alpha decay with a 
flat barrier (see Figure 12.16) of thickness L = 5(R’ — R), 
equal to half the thickness of the Coulomb barrier that the 
alpha particle must penetrate, and height Up = 3(Up + K,), 
equal to half the height of the Coulomb barrier above the 
energy of the alpha particle, estimate the decay half-lives 
for °*Th and *!Th and compare with the measured values 
given in Table 12.2. (Hint: In calculating the speed of the 
alpha particle inside the nucleus, assume that the well depth 
is 30 MeV.) Although the results of this rough calculation 
do not agree well with the measured values, the calculation 
does indicate how barrier penetration is responsible for the 
enormous range of observed half-lives. How would you 
refine the calculation to obtain better agreement with the 
measured values? 
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(a) Using the same replacements described in Problem 41, 
estimate the decay probability of 7*°Ra for alpha emis- 
sion and for '*C emission. (See Examples 12.8 and 12.9.) 
(b) Using the results of part (a), estimate the number of !4C 
emitted relative to the number of alpha particles emitted by 
a source of 7°Ra. 

Compute the recoil proton kinetic energy in neutron beta 
decay (a) when the electron has its maximum energy; 
(6) when the neutrino has its maximum energy. 

In the beta decay of 24Na, an electron is observed with 
a kinetic energy of 2.15 MeV. What is the energy of the 
accompanying neutrino? 

The first excited state of *’Fe decays to the ground state 
with the emission of a 14.4-keV photon in a mean lifetime 
of 141 ns. (a) What is the width AEF of the state? (b) What 
is the recoil kinetic energy of an atom of >’Fe that emits 
a 14.4-keV photon? (c) If the kinetic energy of recoil is 
made negligible by placing the atoms in a solid lattice, res- 
onant absorptions will occur. What velocity is required to 
Doppler-shift the emitted photon so that resonance does not 
occur? 

What is the probability of a '4C atom in atmospheric Co, 
decaying in your lungs during a single breath? The atmo- 
sphere is about 0.03% CO,. Assume you take in about 0.5 
L of air in each breath and exhale it 3.5 s later. 





NUCLEAR REACTIONS AND 
APPLICATIONS 
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radiation exposure affects various materials. They also produce rare radioisotopes that can 
be used for medicine and applications in industry. The photo shows the core of a nuclear 
reactor, which is submerged in water that acts as a neutron moderator. The glow comes 
from Cerenkov radiation, which is emitted when electrons from radioactive decays move at 
speeds greater than the speed of light in water. 
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The knowledge of the nucleus that we can obtain from studying radioactive 
decays is limited, because only certain radioactive processes occur in nature, only 
certain isotopes are made in those processes, and only certain excited states of 
nuclei (those that happen to follow radioactive decays) can be studied. Nuclear 
reactions, however, give us a controllable way to study any nuclear species, and 
to select any excited states of that species. 

In this chapter we discuss some of the different nuclear reactions that can 
occur, and we study the properties of those reactions. Two nuclear reactions are 
of particular importance: fission and fusion. We pay special attention to those 
processes and we discuss how they are useful as sources of energy (or, more 
correctly, as converters of nuclear energy into thermal or electrical energy). 

We conclude our study of nuclear physics with an introduction to some of the 
ways that methods of nuclear physics can be applied to problems in a variety of 
different areas. 


13.1 TYPES OF NUCLEAR REACTIONS 


In a typical nuclear reaction laboratory experiment, a beam of particles of type x 
is incident on a target containing nuclei of type X. After the reaction, an outgoing 
particle y is observed in the laboratory, leaving a residual nucleus Y. Symbolically, 
we write the reaction as 


x4+X > yt+Y 


For example, 
2 63 64 
1H, + 39Cu34 > n+ 39Zn34 


Like a chemical reaction, a nuclear reaction must be balanced—the total number 
of protons must be the same before and after the reaction, and also the total 
number of neutrons must remain the same. In the example above, there are 30 
protons on each side and 35 neutrons on each side. (The forces responsible for 
nuclear beta decay can change neutrons into protons or protons into neutrons, 
but these forces act on a typical time scale of at least 10~!° s. The projectile and 
target nuclei are within the range of one another’s nuclear forces for an interval 
of at most 10~?°s, so there is not enough time for this type of proton-neutron 
conversion to take place.) The protons and neutrons can be rearranged among the 
reacting nuclei, but their numbers cannot change. 

A nuclear reaction takes place under the influence of forces internal to the 
system of projectile and target. The absence of external forces means that the 
reaction conserves energy, linear momentum, and angular momentum. 

In most experiments, we observe only the outgoing light particle y; the heavy 
residual nucleus Y usually loses all its kinetic energy (by collisions with other 
atoms) and therefore stops within the target. 

We assume that we produce the reaction by bombarding target nuclei X, 
initially at rest, with projectiles x of kinetic energy K,. The product particles then 
share this kinetic energy, plus or minus any additional energy from the rest energy 
difference of the initial and final nuclei. (We consider energy in nuclear reactions 
in Section 13.3.) 
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The bombarding particles x can be either charged particles, supplied by a 
suitable nuclear accelerator, or neutrons, whose source may be a nuclear reactor. 
Accelerators for charged particles, illustrated in Figures 13.1 and 13.2, are of two 
basic types. In a cyclotron, a particle is held in a circular orbit by a magnetic 
field and receives a small “kick” by an electric field twice each time it travels 
around the circle; a particle may make perhaps 100 orbits before finally emerging 
with a kinetic energy of the order of 10 to 20 MeV per unit of electric charge. In 
the Van de Graaff accelerator, a particle is accelerated only once from a single 
high-voltage terminal, which may be at a potential of as much as 25 million volts; 
the kinetic energy of the particle is then about 25 MeV per unit of charge. 
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FIGURE 13.1 (a) Schematic diagram of a cyclotron accelerator. Charged particles 
are bent in a circular path by a magnetic field and are accelerated by an electric field 
each time they cross the gap. (b) A cyclotron accelerator. The magnets are in the large 
cylinders at the top and the bottom. The beam of particles is visible as it collides with 
air molecules after leaving the cyclotron. 
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FIGURE 13.2 (a) Diagram of a Van de Graaff accelerator. Particles from the ion 
source are accelerated from the high-voltage terminal to ground. (b) A typical Van de 
Graaff accelerator laboratory. The ion source and high-voltage terminal are inside the 
large pressure tank. 
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In nuclear reaction experiments, we usually measure two basic properties of 
the particle y: its energy, and its probability to emerge at a certain angle with a 
certain energy. We look briefly at these two types of measurements. 


1. Measuring the particle energy If neither the residual nucleus Y nor the 
outgoing particle y had excited states, then by using conservation of energy and 
momentum, we could calculate exactly the energy of y when measured at a certain 
angle. If the nucleus Y is left in an excited state, then the kinetic energy of y is 
reduced by (approximately) the energy of the excited state above the ground state, 
because the two particles Y and y must still share the same amount of total energy. 
Each higher excited state of the nucleus Y corresponds to a certain reduced energy 
of the particle y, and a measurement of the different energies of the particle y 
tells us about the excited states of the nucleus Y. Figure 13.3 shows an example 
of a typical set of experimental results and the corresponding deduced excited 
states of the residual nucleus. Each peak in Figure 13.3 corresponds to a specific 
energy of y, and therefore to a specific excited state of Y; that is, when particles 
with energy 9.0 MeV are observed, the nucleus Y is left in the excited state with 
energy 1.0 MeV. 


2. Measuring the reaction probability Notice that the different peaks in 
Figure 13.3 have different heights. This feature of the results of our experiment 
tells us that it is more probable for the reaction to lead to one excited state 
than to another. This is an example of the reaction probability, the second 
of the properties of y that we can determine. For example, Figure 13.3 shows that 
the probability of leaving Y in its second excited state (1.0 MeV) is about twice 
the probability of leaving Y in its first excited state. If it were possible to solve the 
Schrédinger equation with the nuclear potential energy, we could calculate these 
reaction probabilities and compare them with experiment. Unfortunately we can’t 
solve this many-body problem, so we must work backward by measuring the 
reaction probabilities and then trying to infer some properties of the nuclear force. 
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FIGURE 13.3 A sample spectrum of energies of the outgoing particle y, and the 
corresponding excited states of Y. 
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The Reaction Cross Section 


Reaction probabilities are usually expressed in terms of the cross section, which 
is a sort of effective area presented by the target nucleus to that projectile for a 
specific reaction, for all possible energies and directions of travel of the outgoing 
particle y. The larger the reaction probability, the larger is the corresponding 
cross section. In general, the cross section depends on the energy of the incident 
particle, K,. 

The cross section o is expressed in units of area, but the area is a very small 
one, of the order of 10778 m*. Nuclear physicists use this as a convenient unit of 
measure for cross sections, and it is known as one barn (b): 1 barn = 10778 m?. 
Notice that the area of the disc of a single nucleus of medium weight is about | 
barn; however, reaction cross sections often can be very much greater or less than 
one barn. For example, consider the cross section for these reactions involving 
certain isotopes of the neighboring elements iodine and xenon: 


I+n-— I+n (inelastic scattering) o =4b 
Xe+n— Xe+n (inelastic scattering) o =4b 

I+n-— 1+ y (neutron capture) o=7b 
Xe+n-— Xe-+ y (neutron capture) o =10°b 


You can see that, although the neutron inelastic scattering cross sections of I 
and Xe are similar, the neutron capture cross sections are very different. These 
measurements are therefore telling us something interesting and unusual about 
the properties of the nucleus Xe. 

Suppose a beam of particles is incident on a thin target of area S, which contains 
a total of N nuclei. The effective area of each nucleus is the cross section 0, and 
so the total effective area of all the nuclei in the target is (ignoring shadowing 
effects) oN. The fraction of the target area that this represents is oN/S, and as 
long as this ratio is small, shadowing effects are negligible. This fraction is the 
probability for the reaction to occur. 

Suppose the incident particles strike the target at a rate of /) particles per second, 
and suppose the outgoing particles y are emitted at a rate of R per second. (This is 
also the rate at which the product nucleus Y is formed.) Then the reaction probabil- 
ity can also be expressed as the rate of y divided by the rate of x, or R/Jp. Setting 
the two expressions for the reaction probability equal to each other, we obtain 
oN/S = R/Ip, or 

R oN 
=e 8 (13.1) 
This gives a relationship between the reaction cross section and the rate of 
emission of y. 

In a reactor, the intensity of neutrons is usually expressed in terms of the rate 
at which neutrons cross a unit area perpendicular to the beam, or neutron flux 
¢ (neutrons/cm?/s). The cross section is o (square centimeter per nucleus per 
incident neutron). The rate R also depends on the number of target nuclei. Suppose 
the mass of the target is m; the number of target nuclei is then N = (m/M)N,, 
where M is the molar mass, and N, is Avogadro’s constant (6.02 x 107? atoms 
per mole). Thus, for neutron-induced reactions, using Eq. 13.1 we obtain 


R=deN = o-——Ny (13.2) 
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| Example 13.1 





For a certain incident proton energy the reaction p+ 
*°Fe > n+ °°Co has a cross section of 0.40 b. If we bom- 
bard a target in the form of a 1.0-cm-square, 1.0-jzm-thick 


Next we need to find the number of particles per sec- 
ond in the incident beam. We are given that the current 
is 3.0 x 10-° A = 3.0 x 107° C/s, and with each proton 


iron foil with a beam of protons equivalent to a current having a charge of 1.6 x 107!° C, the beam intensity is 


of 3.0 wA, and if the beam is spread uniformly over the 





entire surface of the target, at what rate are the neutrons 3.0 x 10~° C/s Bo. 
produced? h= 1.6 x 10- Ciparticle = 1.9 x 10° particles/s 
Solution 


We first calculate the number of nuclei in the target. The From Eq. 13.1 we can now find R: 


volume of the target is V = (1.0cm)?(1.0 um) = 1.0 x 

10-4 cm3, and (using the density of iron of 7.9 g/cm?) R 
its mass is m= pV =(7.9g/cm*)(1.0 x 107+ cm’) = 

7.9 x 1074 g. The number of atoms (or nuclei) is then 


_ Nol, 

— e 

= (8.5 x 10!8 nuclei)(0.40 x 10-74 cem?/nucleus) 
x(1.9 x 10!3 particles/s) (1 cm?) | 


= 6.5 x 10’ particles/s 





mNyx _ (7.9 x 10~* g)(6.02 x 10°? atoms/mole) 
van 
= 8.5 x 10! atoms 


N= 





56 g/mole 


About 10° neutrons per second are emitted from the target. 





13.2 RADIOISOTOPE PRODUCTION IN NUCLEAR 


REACTIONS 





Often we use nuclear reactions to produce radioactive isotopes. In this procedure, 
a stable (nonradioactive) isotope X is irradiated with the particle x to form 
the radioactive isotope Y; the outgoing particle y is of no interest and is not 
observed. In this case we don’t observe the individual particles Y as they are 
produced in the reaction; instead, we irradiate the target to produce some number 
of radioactive Y nuclei that remain within the target. After the irradiation we 
observe the radioactive decay of the nuclei Y. 

We would like now to calculate the activity of the isotope Y that is produced 
from a given exposure to a certain quantity of the particle x for a certain time ¢. 
Let R represent the constant rate at which Y is produced; this quantity is related 
to the cross section and to the intensity of the beam of x, as given in Eq. 13.1. 
In a time interval dt, the number of Y nuclei produced is Rdt. The isotope Y is 
radioactive, so the number of nuclei of Y that decay in the interval dt is AN dt, 
where A is the decay constant (A = 0.693/t;,.) and N is the number of Y nuclei 
present. The net change dN in the number of Y nuclei is 


dN = Rdt—iN at (13.3) 
or 
es R-iXN (13.4) 
dt ; 
The solution to this differential equation is 
R —At 
N= ao -—e”™) (13.5) 
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and the activity is 


a(t) =AN = RO —e*) 


Notice that, as expected, a = 0 at t = 0 (there are no nuclei of type Y present 
at the start). For large irradiation times f >> ft), this expression approaches the 
constant value R. When f is small compared with the half-life t;,., the activity 


increases linearly with time: 


anshi=d =i a2 Rw 


(t< ty) 








(13.6) 





Activity 


























(13.7) 1 2 3 4 5 


Irradiation time (units of half-life) 


Figure 13.4 shows the relationship between a(f) and f. As you can see, not much 


activity is gained by irradiating for more than about two half-lives. 


| Example 13.2 


FIGURE 13.4 Formation of activity in 
a nuclear reaction. 





Thirty milligrams of gold are exposed to a neutron flux 
of 3.0 x 10!* neutrons/cm?/s for 1.0 minute. The neutron 
capture cross section of gold is 99 b. Find the resultant 
activity of !°8 Au. 


Solution 

From Appendix D we find that the stable isotope of gold has 
amass number of A = 197, and that radioactive !98 Au has a 
half-life of 2.70 d = 3.88 x 10° min. Thus, using Eq. 13.2, 


R=6¢ —WN, 
—_ Oo 
M 


2 


= (3.0 x io? ee) (»» 0 a= ) 
cm*:-s neutron - nucleus 
( 0.030 g 


197 g/mole 





) (6.02 x 107? atoms/mole) 


=27 10's 
In this case t < t)/7, So we can use Eq. 13.7: 


0.693 


a = Rat = (2.7 x 10!° s-!) ( ————___ 
3.88 x 103 min 


) (1.0 min) 
= 4.8 x 10°s7! = 130 uCi 





| Example 13.3 


The radioactive isotope ®'Cu (t,/. = 3.41h) is to be pro- 
duced by alpha particle reactions on a target of °’Co. A foil 
of cobalt, measuring 1.5cm x 1.5 cm inarea and 2.5 wm in 
thickness, is placed in a 12.0-uA beam of alpha particles; 
the beam uniformly covers the target. For the alpha energy 
selected, the reaction has a cross section of 0.640 b. (a) At 
what rate is the °!Cu produced? (b) What is the resulting 
activity of °'Cu after 2.0 h of irradiation? 


Solution 
(a) The reaction is *°Co + *He — °'Cu+ 2n. The mass 
of the target is m= pV =(8.9 g/em?)(1.5 em)*(2.5 x 
10-+cm) = 5.0 x 10-3 g and the number of target 
atoms is 


mN x 

~M~ 

_ (5.0 x 107? g)(6.02 x 107° atoms/mole) 
7 58.9 g/mole 


= 5.12 x 10!° atoms 


N= 





The rate at which the beam strikes the target is 


_ 12.0% 10-° A, 
~ 2x 1.60 x 10-19 C/particle 


= 3.75 x 10!° particles/s 





Ig 
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The rate at which the °'Cu is produced is, using Eq. 13.1, 


_ Nol 
~' 


R 





_ (5.12 x 101° atoms)(0.640 x 1074 em?) (3.75 x 101° s7!) 


(b) The activity is determined from Eq. 13.6: 


a=R(1l-—e”) 
= (5.5 x 108 s7!) a eo (0.693) (2.0 h)/(3.41h) 





(1.5 cm)? 


=5.5~x 108s! 


=1.8x 108s"! =4.9mCi 
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FIGURE 13.5 Momenta of particles 
before (top) and after (bottom) the 
reaction. 


13.3 LOW-ENERGY REACTION KINEMATICS 


We assume for this discussion that the velocities of the nuclear particles are 
sufficiently small that we can use nonrelativistic kinematics. We consider a 
projectile x moving with momentum p, and kinetic energy K,. The target is at 
rest, and the reaction products have momenta P, and py and kinetic energies K, 
and Ky. The particles y and Y are emitted at angles 0, and @y with respect to the 
direction of the incident beam. Figure 13.5 illustrates this reaction. We assume 
that the resultant nucleus Y is not observed in the laboratory (if it is a heavy 
nucleus, moving relatively slowly, it generally stops within the target). 

As we did in the case of radioactive decay, we use energy conservation to 
compute the Q value for this reaction (assuming X is initially at rest): 


initial energy = final energy 


my (x)c + K, + my(X)c* = my(y)c? + K, + my (Yc? + Ky (13.8) 


The m’s in Eq. 13.8 represent the nuclear masses of the reacting particles. 
However, as we have discussed, the number of protons must be balanced in a 
nuclear reaction: 
Z,+Zy =Z,+Zy (13.9) 
We can therefore add equal numbers of electron masses to each side of Eq. 13.8 
and, neglecting as usual the electron binding energy, the nuclear masses become 
atomic masses with no additional corrections needed. Rewriting Eq. 13.8, we 
obtain 
[m(x) + m(X) — my) — mY) ]c = K, + Ky —K, (13.10) 


The rest energy difference between the initial particles and final particles is defined 
to be the O value of the reaction 


QO = (m, — m,)c* = [m(x) + m(X) — mG) — m(Y)]c? (13.11) 


and, combining Eqs. 13.10 and 13.11, we see that the O value is equal to the 
difference in kinetic energy between the final particles and initial particle: 


Q=K,+K,-K, (13.12) 
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| Example 13.4 





(a) Compute the QO value for the reaction 7H + °Cu > = (2.014102 u + 62.929597 u — 1.008665 u 
n+ “Zn. (b) Deuterons of energy 12.00 MeV are incident — 63.929142 u)(931.5 MeV/u) 


ona Cu target, and neutrons are observed with 16.85 MeV 


of kinetic energy. Find the kinetic energy of the “Zn. = A868 Mey 


(6) From Eq. 13.12, we find 


Solution 
(a) The O value can be found using Eq. 13.11 with masses 
from Appendix D: k= Or kh, 
= 5.488 MeV+ 12.00 MeV—16.85 MeV 
O = [mCH) + m@Cu) — m(n) — m(“Zn)]c? = 0.64 MeV 





Reactions for which Q > 0 convert nuclear energy to kinetic energy of y and Y. 
They are called exothermic or exoergic reactions. Reactions with QO < 0 require 
energy input, in the form of the kinetic energy of x, to be converted into nuclear 
binding energy. These are known as endothermic or endoergic reactions. 

In an endoergic reaction, we must supply at least enough kinetic energy to 
provide the additional rest energy of the reaction products. There is thus some 
minimum, or threshold, kinetic energy of x, below which the reaction will not take 
place. This threshold kinetic energy not only must supply the additional rest energy 
of the products, but also must supply some kinetic energy of the products; even at 
the minimum energy, the products cannot be at rest, for that would violate conser- 
vation of linear momentum—the momentum p,, before the collision would not be 
equal to the momentum of the final products after the collision if they were formed 
at rest. 

This problem is most easily analyzed in the center-of-mass reference frame. 
In the lab frame before the reaction, the center of mass moves with velocity 
v= m(x)v,/[m) + m(X)]. If we travel with that velocity and observe the 
reaction, we would see x moving with velocity v, — v and_X moving with velocity 
—v, as shown in Figure 13.6. If x has exactly the threshold kinetic energy, in this 
reference frame the reaction products y and Y would be at rest. 

We must conserve total relativistic energy K + mc? in the reaction, and 
we restrict our discussion to small velocities v <c so that the nonrelativistic 
expression for the kinetic energy can be used. Energy conservation in the 
center-of-mass frame gives: 





i 2,! 2 2 2 2 2 

zm v, vy + zin(X)( v)* + m(x)c* + m(X)c* = m(y)c* + m(Y)c 
(13.13) 

where v, represents the threshold velocity in the lab frame. Substituting the value 


of v and doing a bit of algebra, we can find the threshold kinetic energy (in the 
laboratory reference frame): 





or rela 13.14 
n= -0/ +=) (13.14) 


Before 
After 


FIGURE 13.6 Reaction at threshold 
in center-of-mass reference frame. 
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| Example 13.5 


Calculate the threshold kinetic energy for the reaction 
p+?H —> 7H+7H (a) if protons are incident on 3H at 


1.007825 u 


= (4.033 MeV) (14+ ———2" 
( ° i( + 3016049u 


) = 5.381 MeV 


rest; (b) if >H (tritons) are incident on protons at rest. 


Solution 
The Q value is 


QO = [m(H) + mCH) — 2mCPH)]c? 
= (1.007825 u + 3.016049 u — 2 x 2.014102 u) 


x (931.5 MeV/u) 
= —4,033 MeV 


(a) When protons are incident on 7H, the identification is 


x= !H,X =7H,so 
Ky =-Q (: + 


m('H) 
m(CH) 





(b) When 3H is incident on protons, the identification of x 
and_X is reversed, so 


-o(1+ 


= (4,033 MeV) (1 + 


3 


m('H) 

3.016049 u 16.10 MeV 
——______ } = 16. e 
1.007825 u 

This calculation illustrates a general result: Less energy is 
required for a nuclear reaction if a light particle is incident 
) on a heavy target than if a heavy particle is incident on a 
light target. 








Lise Meitner (1878-1968, Germany- 
Sweden). Known for her research 
into radioactivity, Meitner discovered 
the radioactive element protactinium 
(Z = 91) and was among the first to 
study the properties of beta decay. 
Her most important discovery was the 
explanation for the puzzling results 
that were observed when uranium was 
bombarded with neutrons. She sug- 
gested that the uranium could split into 
two pieces, and she proposed the name 
“fission” for this process. Element 109 
is named in her honor. 


The massive nucleus ***Cf (Z = 98) can be produced in accelerators by collisions 
between suitably chosen projectiles and targets. This nucleus is of special interest 
because it is also produced in supernova explosions, and knowledge of its 
properties provides a key to understanding the formation of the elements in 
stars, as we discuss later in this chapter. **Cf is radioactive, decaying with 
a half-life of 60.5 d. The QO values for positive and negative beta decay of 
>4Cf are both negative, so that mode of decay is not available. Alpha decay is 
energetically possible but the Coulomb barrier is very high, making that decay 
mode improbable. Instead, ***Cf decays by splitting into two pieces of much 
smaller masses—for example, 


254 140 110 
98Cfise > s4X€go + 4qaRug, + 4n 


This mode of decay is known as nuclear fission. Fission can occur as a spontaneous 
radioactive decay process for a relatively small number of massive nuclei, and it 
can also be induced in other nuclei by adding energy to make the nucleus less 
stable. In addition to the two fission fragments, some neutrons are usually emitted 
in the fission process. 

We can consider the nucleus to be a mixture of protons and neutrons moving 
about under the mutual attraction of their nuclear forces and (in the case of the 
protons) the repulsion of their Coulomb forces. For many nuclei, the result of 
these interactions is a spherical shape that has often been compared to a drop of 
liquid floating freely in a region where no external forces act. The equilibrium 
shape will be close to spherical, and if the nucleus is distorted (for example, 
by stretching it in one direction) it can vibrate about its equilibrium shape and 
eventually return to its spherical shape somewhat like a stretched spring or other 
elastic system returns to its original configuration. Figure 13.7 shows a schematic 
representation of the energy of the drop as a function of the distortion. 


For other nuclei, the equilibrium shape is not spherical but is already distorted; 
their surfaces are like an ellipse rotated about its long axis. For these nuclei, 
the major axis might be 30-50% longer than the minor axis. If these nuclei are 
stretched by a small amount and released, they will usually revert to their distorted 
equilibrium shape. But if the stretching is sufficiently large, they may not return 
to equilibrium but instead may split in two, as represented in Figure 13.8. 

This occurs because of the rather delicate balance between the nuclear force 
that keeps the nucleus together and the Coulomb repulsion force, which makes the 
nucleus less stable. When the stretching is sufficiently large, the total attractive 
nuclear force is reduced (because on the average the protons and neutrons have 
fewer “near neighbors” with which to interact), but the Coulomb force, because 
it has a long range, is not reduced significantly. The center of the distorted shape 
can be “pinched off” and the delicate balance between the nuclear and Coulomb 
forces is upset. The Coulomb force can then drive the two fragments apart. 

Figure 13.8 shows a kind of “barrier” between the distorted equilibrium shape 
and the fissioned nucleus. This fission barrier has a height of roughly 6 MeV, 
but as we know it is possible to “tunnel” through the barrier. Thus nuclei can 
undergo fission with smaller amounts of excitation energy. It is very unlikely for 
a nucleus to tunnel through the barrier at its thickest, but as the excitation energy 
is increased the barrier becomes less thick and fission becomes more probable. 
For the radioactive decay of 7°4Cf, the probability to penetrate the fission barrier 
(99.7%) is greater than the probability to penetrate the barrier to alpha decay 
(0.3%). 

The splitting of a nucleus such as 7>4Cf into two fragments does not always 
produce the same set of final nuclei. Many different processes are possible, with 
the actual outcome determined according to statistical probability. For 7°4Cf it 
is most likely that one fragment will have a mass near A = 110 and the other 
near A = 140, but other mass distributions can occur. Figure 13.9 shows the mass 
distribution of the fragments in the fission of >4Cf and 77>U. 

The number of neutrons emitted in fission can also vary. For **+Cf, the average 
is about 3.9 neutrons per fission. 


Energy Released in Fission 


According to Figure 12.4, the binding energy per nucleon of 74Cf is about 7 MeV. 
If a nucleus of 7°4Cf splits into two nuclei with A = 127, the binding energy per 
nucleon of the final nuclei would be about 8 MeV. Thus the binding energy of 
each of those 254 nucleons increases from about 7 MeV to about 8 MeV, which 
gives an increase in the total binding energy of the nucleus of about 250 MeV. If 
the final nucleons are more tightly bound, that means that an equivalent amount 
of energy has been released to some other form. 

This energy release from a single nucleus is an enormous quantity. For 
comparison, chemical processes such as combustion usually release a few eV per 
atom. In fission we have an energy release at the atomic level that is 10° times 
larger than the energy released in chemical processes! 

Where does this energy go? Let’s imagine that ***Cf suddenly breaks in half to 
form two nuclei with Z = 49 and A = 127 that are just touching at their surfaces. 
The radius of each of these nuclei is 1.2(127)!/3 = 6.0 fm, and the Coulomb 
repulsion of these two nuclei would be U = (49e)*/4:r€y(2R) = 286 MeV, which 
is quite close to our estimate of the nuclear binding energy released. These two 
charged objects repel one another, so that the Coulomb energy quickly becomes 
kinetic energy. Most of the energy released in fission is thus produced as 
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FIGURE 13.7 The energy ofanucleus 
with a spherical equilibrium shape in- 
creases as the distortion increases. 
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FIGURE 13.8 The energy of a nu- 
cleus with a nonspherical equilibrium 
shape. If enough energy is added, the 
nucleus can tunnel through the fission 
barrier and split into two pieces. 
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FIGURE 13.9 The mass distribution 
of fission fragments from 7>4Cf (solid 
line) and 73°U (dashed line). 
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Nuclear binding energy 
Coulomb potential energy of nuclear fragments 
Kinetic energy of fragments 


About 80% of the fission energy is released in this form. The fragments do not 
travel very far before dissipating their kinetic energy through atomic collisions, 
which are usually observed as a temperature increase of the material. In a power 
reactor, this temperature increase can be used to produce steam, which can drive a 
turbine to produce electricity. The remaining 20% of the energy released appears 
as decay products (betas and gammas) of the highly radioactive fragments and 
kinetic energies of the neutrons that may also be emitted during fission. 


Induced Fission 


The radioactive decay of **4Cf is an example of the spontaneous fission of a 
nucleus that is sufficiently unstable that it can tunnel through the fission barrier 
with no additional energy needed. However, **Cf is an artificially produced 
nucleus that does not occur in nature and in which the energy released in fission is 
essentially stored in the nucleus by the nuclear reaction that was used to produce 
the **4Cf. There are other examples of fissionable nuclei that may occur naturally 
or are produced artificially but that do not fission spontaneously. These nuclei can 
be made to fission by the addition of some energy, which might be in the form 
of an absorbed photon but more often occurs with the absorption of a neutron. In 
these cases the energy input is very small compared with the energy released in 
the fission process. One such nucleus is 7°>U, which might absorb a neutron to 
make *°°U and then fission according to 


235 93 141 
92 Uy43 + > 37RbD<56 + 55 CSgo + 2 


As in the case of spontaneous fission, many different outcomes are possible, 
with a statistical distribution of the masses of the fragments. In the fission of 
?35U, the most probable outcome has fragments of mass numbers near A = 90 
and A = 140 (as in Figure 13.9), and the average number of neutrons is about 
2.5. Examples of easily produced fissionable nuclei include 7°?Pu (obtained from 
the beta decay of 7°U, which is made when *°8U absorbs a neutron) and 77U 
(obtained in a similar manner from ***Th). 

In a bulk sample of uranium, each of the neutrons emitted in fission can be 
absorbed by another nucleus of 7°>U and thus induce another fission process, 
resulting in the emission of still more neutrons, followed by more fissions, and 
so forth. As long as the average number of neutrons available to produce new 
fissions is greater than | per reaction, the number of fissions grows with time. 
This avalanche or chain reaction of fission events, each with the release of about 
200 MeV of energy, can either occur under very rapid and uncontrolled conditions, 
as in a nuclear weapon, or else under slower and carefully controlled conditions, 
as in a nuclear reactor. 


Electrical Power from Fission 


Electrical power can be generated using the thermal energy released in fission 
to boil water. Ordinary uranium by itself cannot serve as fuel for the reactor for 
several reasons, of which three in particular stand out: enrichment, moderation, 
and control. 


Enrichment To maintain a steady energy production from fission reactions, 
we would like for one neutron from each fission to be available to produce another 
fission. Generally the average number of neutrons produced is greater than one, but 
neutrons can be lost from the reaction in a variety of ways (for example, by non- 
fission absorption by 7°°U in uranium fuel). Natural uranium consists of only about 
0.7% of 735U and 99.3% 738U, which means that most of the available uranium 
nuclei generally do not participate in the fission process but instead remove 
neutrons from being able to produce other fissions. To overcome this problem it 
is necessary to use enriched uranium, in which the abundance of *>U is increased 
beyond its natural value of 0.7%. Most power reactors use uranium enriched to 
3—5%*>U. Enrichment is a difficult process because 7°>U and 7°8U are chemically 
identical. It is achieved only by taking into account the small mass difference 
between the two isotopes (for example, by forcing gaseous uranium through a 
porous barrier in which the more massive 7°8U atoms diffuse more slowly). 


Moderation The neutrons produced in fission typically have kinetic energies of 
a few MeV. Such energetic neutrons have a relatively low probability of inducing 
new fissions, because the fission cross section generally decreases rapidly with 
increasing neutron energy. We therefore must slow down, or moderate, these 
neutrons in order to increase their chances of initiating fission events. The 
fissionable material is surrounded by a moderator, and the neutrons lose energy 
in collisions with the atoms of the moderator. When a neutron is scattered from a 
heavy nucleus like uranium, the energy of the neutron is changed hardly at all, but 
in a collision with a very light nucleus, the neutron can lose substantial energy. 
The most effective moderator is one whose atoms have about the same mass as a 
neutron; hydrogen is therefore the first choice. Ordinary water is frequently used 
as a moderator, because collisions with the protons are very effective in slowing 
the neutrons; however, neutrons have a relatively high probability of being 
absorbed by the water according to the reaction p+n— aH + y. So-called 
“heavy water,” in which the hydrogen is replaced by deuterium, is more useful as 
a moderator, because it has virtually no neutron absorption cross section. A heavy- 
water reactor, which has more available neutrons, can use ordinary (nonenriched) 
uranium as fuel; a reactor using ordinary water as moderator has fewer neutrons 
available to produce fission, and must therefore have more 7*°U in its core. 

Carbon is a light material that is solid, stable, and abundant, and that has a 
relatively small neutron absorption cross section. Enrico Fermi and his co-workers 
built the first nuclear reactor in 1942 at the University of Chicago; this reactor 
used carbon, in the form of graphite blocks, as moderator. 


Control To produce a stable nuclear reactor, the average number of neutrons 
in each fission reaction that is available to produce the next set of fission 
reactions must be exactly equal to 1. If it is even slightly greater than 1, the 
reaction rate will grow exponentially out of control. Control of the reaction rate is 
usually accomplished by inserting into the core of the reactor control rods made 
of cadmium, which has a very large cross section for absorbing neutrons and 
thus removing them from the fission process. However, small fluctuations in the 
reaction rate occur much too rapidly for any mechanical system to move the control 
rods in and out to control the number of neutrons emitted in the fission reaction. 
Fortunately, nature has provided us with the solution to this problem. About 
1% of the neutrons emitted in fission are delayed neutrons, produced not at the 
instant of fission but somewhat later, following the radioactive decays of the 
fission fragments. For example “Rb, which might be produced in the fission of 
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FIGURE 13.11 The components of a 
pressurized-water reactor. 
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FIGURE 13.10 A typical sequence of processes in fission. A nucleus of 77>U 
absorbs a neutron and fissions; two prompt neutrons and one delayed neutron are 
emitted. Following moderation, two neutrons cause new fissions and the third is 
captured by 738, resulting finally in 77Pu. 
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35U, beta decays with a half-life of 6 s to °°Sr, which is occasionally (in about 
1% of decays) produced in a very high excited state that is unstable to neutron 
emission. The neutron appears to emerge with the 6-s half-life of the beta decay. 
This short delay time is enough to allow the control rods to be adjusted to maintain 
a constant reaction rate. The reactor is designed so that the neutron replication 
rate is just less than | for the prompt neutrons and exactly equal to 1 for prompt 
+ delayed neutrons, which allows the control rods to work effectively. 

Figure 13.10 summarizes some of the processes that can occur in fission. A 
nucleus of **°U captures a neutron and fissions into two heavy fragments and two 
prompt neutrons; one of the fragments emits a delayed neutron. The three neutrons 
are slowed by passage through the moderator. Two of the neutrons cause new fis- 
sions, and the third is captured by 77°U, eventually to form fissionable **°Pu, which 
can be recovered from the fuel by chemical means. Not shown in this diagram are 
other processes that can occur: escape of neutrons through the surface of the reac- 
tor, capture in the moderator, and fission of 7*°U by fast (unmoderated) neutrons. 


Fission Reactors 


In a fission reactor, the heat produced in the fuel must be extracted to generate 
electrical power. It must also be extracted for reasons of safety, because enough 
heat is produced to melt the core and cause a serious accident. For this reason, 
reactors contain an emergency core cooling system that is designed to prevent the 
core from overheating if the heat extraction system should fail. 

Extracting the fission energy from the reactor core can be accomplished through 
several different techniques. In one design, called the pressurized water reactor 
and illustrated in Figure 13.11, the heat is extracted in a two-step process. Water 
circulates through the core under great pressure, to prevent its turning to steam. 
This hot water then in turn heats a second water system, which actually delivers 
steam to the turbine. The steam never enters the reactor core, so it does not become 
radioactive, and thus there is no radioactive material in the vicinity of the turbine. 

The power reactors in the United States are mostly the pressurized-water type 
using enriched uranium as fuel and ordinary water as moderator. Canada also 


uses pressurized water reactors, but heavy water and natural uranium are used. In 
a variation on this design, the pressurized water is replaced with a liquid metal 
such as sodium, which has the advantages of remaining liquid at much higher 
temperatures than water and of having a larger thermal conductivity than water. 
Yet another design uses gas flow through the core to extract the heat; the hot 
gas is then used to produce steam. Reactors in Great Britain are gas-cooled and 
graphite-moderated. 

There are yet other technological problems associated with nuclear power 
that are the subjects of active debate and investigation. Some of the radioactive 
isotopes among the fission fragments have very long half-lives, of the order of 
many years. The radioactive waste from reactors must be stored in a manner that 
prevents leakage of radioactive material into the biological environment. Many 
people are concerned about the safety of nuclear reactors, not only regarding 
proper design and operation, but also about their resistance to natural disasters 
such as earthquakes or to acts of terrorism or sabotage. 

In 1986, a graphite-moderated power reactor at Chernobyl in the former 
U.S.S.R. suffered a serious accident due to the disabling of the core cooling 
system, which is designed to extract the intense heat generated in the reactor 
core. The resulting temperature rise ignited the graphite moderator and caused an 
explosion of the reactor containment vessel, releasing radioactive fission products 
and exposing the inhabitants of the region to life-threatening radiation doses. The 
water-moderated power reactors used in the United States cannot suffer this kind 
of accident. 

The vulnerability of reactors to natural disasters was dramatically revealed by 
the earthquake-triggered tsunami that struck Japan’s Fukushima reactor complex 
in 2011. The flooding of the reactor buildings caused the pumps supplying cooling 
water to fail. As a result, the reactor core overheated due to the radioactive decay of 
the fission products, and a partial meltdown of the fuel rods occurred. The ensuing 
release of radioactivity contaminated a wide region of the Japanese countryside. 

Finally, as in all heat engines, the disposal of the exhaust or waste heat 
(primarily from the steam recondensing to water) generates considerable thermal 
pollution. Nuclear power plants are generally less efficient at converting fuel 
to electrical power compared with plants that burn fossil fuels, because nuclear 
plants operate at lower temperatures; while fossil-fuel plants can have efficiencies 
as large as 40%, nuclear plants are generally in the range of 30 to 35%. A plant 
operating at 30% efficiency produces 50% more thermal pollution than one that 
generates the same amount of power at 40% efficiency. 


A Naturally Occurring Fission Reactor 


We conclude this section with a fascinating example of nature at work—the first 
sustained nuclear fission reactor on Earth was not the one constructed by Fermi 
in Chicago in 1942, but a natural fission reactor in Africa, which is believed 
to have operated two billion years ago for a period of perhaps several hundred 
thousand years. This reactor of course used naturally occurring uranium as a fuel 
and naturally occurring water as a moderator. 

It would not be possible to build such a reactor today, because the capture of 
neutrons by the protons in water results in too few neutrons remaining to sustain 
a chain reaction in uranium with only 0.7% of 73°U. However, two billion years 
ago, naturally occurring uranium contained a much larger fraction of 77>U than 
does present-day uranium. Both 77°U and 738U are radioactive, but the half-life 
of 73°U is only about one-sixth as great as the half-life of *°U. If we go back in 
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time about 2 x 10° y, which is half of one half-life of 7>°U, there was about 40% 
more 7°°U than there is today, but there was 23 = 8 times as much 7>U. Naturally 
occurring uranium was then about 3% 775U, and, at such enrichments, ordinary 
water can serve as an effective moderator. 

A deposit of such uranium, in a large enough mass and with ground water 
present to act as moderator, could have “gone critical” and begun to react. The 
reaction could have been controlled by the boiling of the water—when enough 
heat had been generated to evaporate some of the water, the reaction would slow 
down and perhaps stop, because of the lack of a moderator. When the uranium 
had cooled sufficiently to allow more liquid water to collect, the reactor would 
have started up again. This cycle could in principle have continued indefinitely, 
until enough 7°>U was used up or until geological changes resulted in the removal 
of the water. 

The discovery of this reactor followed the observation that the uranium that was 
being mined from that region in Africa contained too little °U. The discrepancy 
was a very small one—the samples contained 0.7171% 775U, compared with the 
usual 0.7202%— but it was enough to stimulate the curiosity of the researchers. 
They guessed that the only mechanism that could result in the consumption of 
235 was the nuclear fission process, and this guess was tested by searching in the 
ore for stable isotopes that result from the radioactive decay of fission products. 
When such isotopes were found, and in particular when they were found in 
abundances very different from what would be expected from “natural” mineral 
deposits, the existence of the natural reactor was confirmed. 


Energy may also be released in nuclear reactions in the process of fusion, in 
which two light nuclei combine to form a heavier nucleus. The energy released 
in this process is the excess binding energy of the heavy nucleus compared with 
the lighter nuclei; from Figure 12.4, we see that this process can release energy as 
long as the final nucleus is less massive than about A = 60. 

For example, consider the reaction 


7H, + 7H, > 7H, + |Ho 


The Q value is 4.0 MeV, and so this nuclear reaction liberates about 1 MeV per 
nucleon, roughly the same as the fission reaction. This reaction can occur when 
a beam of deuterons is accelerated on to a deuterium target. In order to observe 
the reaction, we must get the incident and target deuterons close enough that the 
nuclear force can produce the reaction; that is, we must overcome the mutual 
Coulomb repulsion of the two particles. We can estimate this Coulomb repulsion 
by calculating the electrostatic repulsion of two deuterons when they are just 
touching. The radius of a deuteron is about 1.5 fm, and the electrostatic potential 
energy of the two charges separated by about 3 fm is about 0.5 MeV. A deuteron 
with 0.5 MeV of kinetic energy can overcome the Coulomb repulsion and initiate 
a reaction in which 4.5 MeV of energy (0.5 MeV of incident kinetic energy plus 
the 4-MeV Q value) is released. 

Doing this reaction in a typical accelerator, in which the beam currents are 
typically in the microampere range, would produce only a small amount of energy 
(of the order of a few watts). To obtain significant amounts of energy from fusion, 


it is necessary to work with much larger quantities of deuterium. For example, 
the fusion energy from the deuterium in a liter of ordinary water (which contains 
0.015% D,O) would be equivalent to the chemical energy obtained from burning 
about 300 liters of gasoline. 

A more promising approach consists of heating deuterium gas to a high enough 
temperature so that each atom of deuterium has about 0.25 MeV of thermal kinetic 
energy (hence the name thermonuclear fusion). Then in a collision between two 
deuterium atoms, the total of 0.5 MeV of kinetic energy would be sufficient to 
overcome the Coulomb repulsion. 

The difficulty with this approach is in heating the deuterium gas to a sufficient 
temperature; from the expression SAT for the thermal kinetic energy of a 
gas molecule, we can calculate that an energy of 0.25 MeV corresponds to a 
temperature of the order of 10? K. Even assuming that barrier penetration (Section 
5.6) would allow a reasonable probability to penetrate the Coulomb barrier at 
lower kinetic energies (perhaps corresponding to one-tenth of the calculated 
temperature), it is hard to imagine conditions under which these temperatures 
can be created. However, such conditions do exist in the interiors of stars, which 
produce their energy through fusion reactions. Fusion processes thus support all 
life on Earth. Scientists and engineers who are seeking to develop fusion processes 
for electrical power generation face the challenge of duplicating, for a brief instant 
of time and on a much smaller scale, the conditions in the interior of stars. 


Fusion Processes in Stars 


In the basic fusion process that occurs in stars (including our Sun), four protons 
combine to make one “He. Stars are composed of ordinary hydrogen rather than 
deuterium, so it is first necessary to convert the hydrogen to deuterium. This is 
done according to the reaction 


tH, + {Hy > 7H, tet +0 


This process involves converting a proton to a neutron and is analogous to 
the beta-decay processes discussed in Chapter 12. Once we have obtained *H 
(deuterium), the next reaction that can occur is 


7H, + {Hy > 3He, + 
followed by 
3He, +3He, > 4He, + 21H, 


Note that the first two reactions must occur twice in order to produce the two *He 
we need for the third reaction; see the schematic diagram of Figure 13.12. We can 
write the net process as 


4}Hy > He, + 2et + 2v + 2y 


For the calculation of the QO value in terms of atomic masses, four electrons 
must be added to the left side to make four neutral hydrogen atoms. To balance 
the reaction we must also add four electrons to the right side; two of these are 
associated with the *He atom, and the other two can be combined with the two 
positrons according to the reaction e* + e~ — 2y, so that the additional gamma 
rays are available as energy from the reaction. The two positrons disappear in this 
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FIGURE 13.12 Schematic diagram of 
processes in the fusion of protons to 
form helium. 
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process; the only masses remaining are four hydrogen atoms and the one helium 
atom, and so 


O = (m; — m,)c? = [4m('H) — m(*He)]c? 
= (4 x 1.007825 u — 4.002603 u)(931.5 MeV/u) = 26.7 MeV 


Each fusion reaction liberates about 26.7 MeV of energy. 

At what rate do these reactions occur in the Sun? About 1.4 x 10° W of 
solar power is incident on each square meter of the Earth’s surface. At our 
distance of about 1.5 x 10!! m from the Sun, its energy is spread over a sphere 
of area 4717? = 28 x 107? m’, and thus the power output from the Sun is about 
4 x 10°° W, which corresponds to about 2 x 10°? MeV/s. Each fusion reaction 
liberates about 26 MeV, and thus there must be about 10°° fusion reactions 
per second, consuming about 4 x 10°® protons per second. (Don’t worry about 
running out of protons—the Sun’s mass is about 2 x 10°°kg, which corresponds 
to about 10°” protons, enough to burn for the next few billion years.) 

The sequence of reactions described above is called the proton-proton cycle 
and probably represents the source of the Sun’s energy. However, it is probably 
not the primary source of fusion energy in many stars, because the first reaction 
(in which two protons combine to form a deuteron), which is similar to beta decay, 
takes place only on a very long time scale (as we discuss in the next chapter), 
and is therefore very unlikely to occur. A more likely sequence of reactions is the 
carbon cycle: 


ca ees H> PN+y 
BN > PC+eF +p 
BorlH > MN y 
MN 4H Boyy 
Bo + bN+et4p 
by 41H > ?C+*He 





A symbolic diagram of the process is shown in Figure 13.13. Notice that the 
"C plays the role of catalyst; we neither produce nor consume any !*C in these 
reactions, but the presence of the carbon permits this sequence of reactions to take 
place at a much greater rate than the previously discussed proton-proton cycle. 
The net process is still described by 4'H —> *He, and of course the O value is 
the same. The Coulomb repulsion between H and C is larger than the Coulomb 
repulsion between two H nuclei, so more thermal energy and a correspondingly 
higher temperature are needed for the carbon cycle. The carbon cycle probably 
becomes important at a temperature of about 20 x 10° K, while the Sun’s interior 
temperature is “only” 15 x 10°K. 





FIGURE 13.13 Sequence of events in the carbon cycle. 


Fusion Reactors 


For a controlled thermonuclear reactor, several reactions could be used, such as 


°H+°H>7H+'H = O=4.0MeV 
7H+7H—>?7He+n Q=3.3MeV 
7H+37H—>‘4*He+n Q=17.6MeV 


The third reaction, known as the D-T (deuterium-tritium) reaction, has the largest 
energy release and is perhaps the best candidate for a fusion reactor. When 
deuterium gas (or a deuterium-tritium mixture) is heated to a high temperature, 
the atoms become ionized; the resulting gas of hot, ionized particles is called 
a plasma. To increase the probability of collisions between the ions that would 
result in fusion, there are three requirements for the plasma: (1) a high density n, 
so that the particles have a high probability of collision; (2) a high temperature T, 
in the range of 10° K, which increases the probability for the particles to penetrate 
their mutual Coulomb barrier; and (3) a long confinement time t, during which 
the high temperature and density must be maintained. The first and third of these 
parameters can be combined using some fairly general considerations based on 
the power needed to heat the plasma (which is proportional to the density n) and 
the power derived from fusions in the plasma (which is proportional to n*r). For 
the fusion power to exceed the input power, the product nt must exceed a certain 
minimum value; this condition is 


nt > 107° s-m~? (13.15) 


which is known as Lawson’s criterion. The capability of a plasma to produce 
energy through fusion can be characterized by the value of its Lawson’s parameter 
nt and its temperature 7. 

The electrical repulsion of the ionized particles in a plasma tends to force the 
ions away from one another and toward the walls of their container, where they 
would lose energy in collisions with the cooler atoms of the walls. To maintain 
the density and temperature, two techniques are under development. In magnetic 
confinement, intense magnetic fields are used to trap the motion of the particles, 
and in inertial confinement, the plasma is heated and compressed so quickly that 
fusion occurs before the fuel can expand and cool. 


Magnetic confinement A magnetic field can confine a plasma because the 
charged particles spiral around the magnetic field lines. Figure 13.14 shows 
a toroidal magnetic confinement geometry. There are two contributions to the 
magnetic field: One is along the toroid axis and another is around the axis. The 
combination of these two fields gives a helical field along the toroid axis, and 
the charged particles are confined as they spiral about the field lines. This type of 
device is a called a tokamak (from the Russian acronym for “toroidal magnetic 
chamber”). A current passed through the plasma serves both to heat the plasma 
and to create one of the magnetic field components. Figure 13.15 shows the 
Tokamak Fusion Test Reactor at Princeton University, which operated from 
1982 to 1997 and achieved an ion temperature of 5.1 x 10° K and a fusion power 
level of 10.7 MW. This device came very close to reaching Lawson’s criterion 
with a plasma density of n = 107° particles/m? (five orders of magnitude smaller 
than an ordinary gas) and a confinement time of t = 0.2s. 
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FIGURE 13.14 The toroidal geome- 
try of plasma confinement. The ion- 
ized atoms circulate around the ring, 
trapped by the magnetic field lines. The 
coils produce a magnetic field along the 
axis of the toroid (dashed line). Another 
field component is produced by a cur- 
rent along the axis in the plasma. The 
two components of the field produce 
the helical field lines shown. 
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FIGURE 13.16 The approach to 
breakeven and ignition in fusion reac- 
tors, shown as a plot of Lawson’s 
parameter against temperature. 





FIGURE 13.15 The inside of the Tokamak Fusion Test Reactor. The technician 
at the left gives a measure of the size of the toroidal chamber. (Dietmar 
Krause/Princeton Plasma Physics Laboratory.) 


The development of magnetic confinement devices has produced a steady 
march toward achieving a self-sustaining fusion reactor by increasing the values 
of both Lawson’s parameter nt and the temperature, as illustrated in Figure 13.16. 
Devices have closely approached “breakeven,” where the power produced by 
fusion reactions equals the power necessary to heat the plasma. A true self- 
sustaining reactor requires the attainment of “ignition,” where the power produced 
by fusion reactions can maintain the reactor with no external source of energy. 
The next generation of fusion reactor development is the ITER (originally, the 
International Thermonuclear Experimental Reactor), currently under construction 
in France as a collaboration among many nations and expected to be operational in 
the year 2016. The ITER is planned to produce fusion power levels that are 5—10 
times what is necessary to heat the plasma (that is, 5—10 times the “breakeven” 
condition). 


Inertial confinement Inertial confinement takes the opposite approach by 
compressing the fuel to high densities for very short confinement times. In one 
method, which is illustrated in Figure 13.17, a small pellet of D-T fuel is struck 
simultaneously from many directions by intense laser beams that first vaporize 
the pellet and convert it to a plasma, and then heat and compress it to the point 
at which fusion can occur. The laser pulses are very short, typically lasting only 
about | ns, and thus according to Lawson’s criterion the density must exceed 10°? 
particles/m>. However, because of inefficiencies of the lasers and other losses a 
self-sustaining laser fusion reactor must exceed this minimum by perhaps 2—3 
orders of magnitude. Figure 13.18 shows the target chamber of the National 
Ignition Facility at the Lawrence Livermore National Laboratory. Operating for 
the first time in 2010, it is designed so that a 2-mm diameter pellet of D-T is 
struck simultaneously by 192 laser beams that deliver an energy of | MJ ina pulse 
lasting a few ns, which is expected to compress the pellet to a central density that 
is 100 times that of lead. 
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FIGURE 13.17 Inertial confinement fusion initiated by a laser. 





FIGURE 13.18 Workers inside the 10-m-diameter target chamber where 
192 laser beams (which enter the chamber through the circular ports) strike 
the target that is held by the positioning arm at the right. (Courtesy Lawrence 
Livermore National Laboratory.) 


In the D-T fusion reaction, most of the energy is carried by the neutrons 
(recall that in the fission reaction only a small fraction of the energy went to the 
neutrons). This presents some difficult problems for the recovery of the energy and 
its conversion into electrical power. One possibility for a fusion reactor design is 
shown in Figure 13.19. The reaction area is surrounded by lithium, which captures 
neutrons by the reaction 


Sig +n— 3He, + 7H, 


The kinetic energies of the reaction products are rapidly dissipated as heat, and 
the thermal energy of the liquid lithium can be used to convert water to steam in 
order to generate electricity. This reaction has the added advantage of producing 
tritium (+H), which is needed as a fuel for the fusion reactor. 

One difficulty with the D-T fusion process is the large number of neutrons 
released in the reactions. Although fusion reactors will not produce the radioactive 
wastes that fission reactors do, the neutrons are sure to make radioactive the 
immediate area surrounding the reactor, and the structural damage to materials 
resulting from exposure to large fluxes of neutrons may weaken critical parts of 
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FIGURE 13.19 Design of a fusion reactor. 


the reactor vessel. Here once again the lithium is helpful, because a 1-m thickness 
of lithium should be sufficient to stop essentially all of the neutrons. 

Fusion energy is the subject of vigorous research in many laboratories in the 
United States and around the world; the technological problems are being attacked 
with a variety of methods, and researchers are hopeful that solutions can be found 
during the next 20 years so that fusion can help to supply our electrical power needs. 





13.6 NUCLEOSYNTHESIS 


After a star’s hydrogen has been converted to helium through fusion reactions, 
gravitational collapse can occur that raises the temperature of the core of the star 
from about 107K to about 10°K. At this point there is enough thermal kinetic 
energy to overcome the Coulomb repulsion of the helium nuclei, and helium fusion 
can begin. In this process three *He are converted into !*C by the two-step process 


4He + “He > ®Be 
8Be+ *He = '?C 


The first reaction is endothermic, with a Q value of 92 keV. The nucleus *Be 
is unstable and decays back into two alpha particles in a time of the order of 
10~!° s, Even so, the Boltzmann factor e~44/*" suggests that at 10° K there will 
be a small concentration of *Be. The second reaction has a particularly large 
cross section; in spite of the rapid breakup of *Be, there is still a good chance 
to form !*C. The net Q value for the process is 7.3 MeV, or about 0.6 MeV per 
nucleon, much less than the 6.7 MeV per nucleon produced by hydrogen burning. 

Once enough !*C has formed in the core, other alpha particle reactions become 
possible, such as 


°C +“He > 0 
6 4+ 4He > 7°Ne 
0Ne + 4He > 4 Mg 





Each of these reactions is exothermic, releasing a few MeV of energy and 
contributing to the star’s energy production. At still higher temperatures (10° K) 
carbon burning and oxygen burning begin: 


2o42C_, Ne + 4He 
O44 16g =< 289i 4 4He 


Eventually *°Fe is reached, at which point no further energy is gained by fusion 
(Figure 12.4). 

If this explanation of the formation of elements is correct, we expect the 
abundances of the elements to have the following properties: 


1. Large relative abundances of the light, even-Z elements; small relative 
abundances of odd-Z elements. 

2. Little or none of the elements between He and C (Li, Be, B), which are not 
produced in these reactions. 

3. Large relative abundance of Fe, the end product of the fusion cycle. 


Figure 13.20 shows the relative abundances of the light elements in the solar 
system, and they are in agreement with all of the three above expectations. Each 
even-Z element is 10 to 100 times more abundant than its odd-Z neighbors; there 
is a prominent peak at Fe; the heavy elements with Z > 30 combined are less 
abundant than every element but one in the range C to Zn; and the three elements 
Li, Be, B are far less abundant than the elements in the range C to Zn. 

The light odd-Z elements can be produced by alternative reactions among the 
fusion products, for example: 


°C+"C > Na+ 'H 
169 4 169 _, 3Ip4 ly 
The abundance of nitrogen is nearly equal to that of its neighbors C and O, 
which are the most abundant of elements beyond H and He; nitrogen has a greater 


abundance than any other odd-Z element shown, and greater than all even-Z 
elements with Z> 8. The formation of nitrogen must therefore be a relatively 
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FIGURE 13.20 Relative abundances (by weight) of the elements beyond helium in the 
solar system. 
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common process in stars. The element B is rare, so alpha particle reactions are of 
no help in forming nitrogen. The most likely sources of N are 


Rc+lH +> BN+y MO+'H> VF+y 
BN > BC+et4+v and Po» VWQO+et+y 
Bc+ Ho MN+y 79 +1H > 4N + 4He 


The stable isotopes ‘°C and '’O are found in natural carbon and oxygen with 
abundances of 1.1% and 0.04%, which suggests that these reactions do indeed 
take place. 

The production of the elements beyond iron requires the presence of neutrons, 
which are not produced in the reactions we have listed so far, because neutrons 
are likely to be emitted only in reactions with nuclei that have an excess of 
neutrons. If enough of the heavier isotopes, such as °C, !’0, or 7!Ne, are formed, 
the following reactions can produce neutrons: 


Bo4*He > “O+n 
"94 4He > Ne+n 
*INe+4He > *4Mg+n 
How are the heavy elements built up by neutron capture? Consider the effect 
of neutron capture on *°Fe: 
Fe +n— >/Fe (stable) 
Fe +n—> 8Fe (stable) 
SFe+n—> Fe (ty. = 45d) 
What happens next depends on the number of available neutrons. If that number 
is small, the chances of °’Fe encountering a neutron before it decays to °’Co are 
small, and the process might continue as follows: 
Fe > °Cote +0 
°Cotn> Co (tj = Sy) 
Co > “Ni +e7 +0 
On the other hand, if the number of neutrons is very large, a different sequence 
might result: 


Fe+n> Fe (tj) =3 x 10° y) 
Fe+n> Fe (tj. = 6m) 
lke > Cot e7 +0 

61C9 = Ni +e° +0 


If the density of neutrons is so low that the chance of encountering a neutron is, 
on the average, less than once every 45 days, the first process ought to dominate, 
with the production of °°Ni. If the chance of encountering a neutron is more 
like once every few minutes, the second process should dominate, and no °Ni is 
produced. 

The first type of process, which occurs s/owly and allows the nuclei time to 
beta decay, is known as the s process (s for slow); the second process occurs very 
rapidly and is known as the r process (r for rapid). 
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Figure 13.21 illustrates how the r and s processes can proceed from *°Fe. The 
S process never strays very far from the region of the stable nuclei, while the r 
process can produce many nuclei that have a large excess of neutrons. The larger 
the excess of neutrons, the shorter is the half-life of these nuclei. Eventually the 
half-life becomes so short that no neutron 1s captured before the beta decay occurs 
to the next higher Z. All nuclei produced in the r process will eventually decay 
toward the stable nuclei, generally moving by beta decays along the diagonal line 
of constant mass number A. 

Some stable nuclei are produced only through the s process, others are produced 
only through the r process, and some may be produced through both processes. 
Often the natural abundance of the isotopes of an element can suggest the relative 
roles of these two processes. In Figure 13.21, you can see that the stable isotope 
Zn cannot be produced in the s process, because the half-life of ©’Zn is too short 
(56 min). Other isotopes for which the r process is important are ’°Ge, 8?Se, ®°Kr, 
°©Z7r, and '**Sn. The isotope “Ni can be produced either through the s process 
(as shown in Figure 13.21) or through the r process (such as through beta decays 
beginning with ©Fe). On the other hand, ©Zn (the most abundant isotope of zinc) 
is produced only through the s process, because r-process beta decays proceeding 
along the A = 64 line are stopped at stable °*Ni and cannot reach “Zn. 

The heaviest element that can be built up out of s-process neutron captures 
is 7°Bi; the half-lives of the isotopes beyond *°’Bi are too short to allow the s 
process to continue. The presence in nature of heavier elements such as thorium 
or uranium suggests that the r process must operate in this region as well. 

The r process most likely occurs during supernova explosions, following the 
breakdown and implosion of a star that has used up its fusion reserves. In a 
very short time, lasting of the order of seconds, the star implodes, produces an 
enormous flux of neutrons (perhaps 10°* n/cm*/s), and builds up all elements to 
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FIGURE 13.21 A section of the chart of the nuclides (Figure 12.11), showing the s- and r-process paths from *°Fe. 
Shaded squares represent stable nuclei, and unshaded squares represent radioactive nuclei. Many r-process paths are 
possible, as the short-lived nuclei beta decay; only one of those possible paths is shown. All the nuclei in the r-process 
path are unstable and may beta decay toward the stable nuclei. 
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about A = 260. When the final explosion occurs, these elements are hurled out 
into space, to become part of new star systems. The heavy atoms of which the 
Earth is made may have been produced in such an explosion. 


13.7 APPLICATIONS OF NUCLEAR PHYSICS 


In this chapter, we have discussed how fission and fusion reactions can be used 
to generate electrical power, and in the previous chapter we discussed how the 
radioactive decay of various isotopes can be used to date the historical origin of 
material containing those isotopes. These are but a few of the many ways that 
nuclear decays and reactions can be applied to the solution of practical problems. 
In this section we discuss briefly some other applications of the techniques of 
nuclear physics. 


Neutron Activation Analysis 


Nearly every radioactive isotope emits characteristic gamma rays, and many 
chemical elements can be identified by their gamma ray spectra. For example, 
when *°Co (the only stable isotope of cobalt) is placed in a flux of neutrons (such 
as is found near the core of a reactor), neutron absorption results in the production 
of the radioactive isotope ®°Co, which beta decays with a half-life of 5.27 years. 
Following the beta decay, °°Ni emits two gamma rays of energies 1.17 MeV and 
1.33 MeV and of equal intensity. If we place in a flux of neutrons a material of 
unknown composition, and if we observe, following the neutron bombardment, 
two gamma rays of equal intensity and energies 1.17 MeV and 1.33 MeV, it is 
a safe bet that the unknown sample contained cobalt. In fact, from the rate of 
gamma emission we could deduce exactly how much cobalt the material contains, 
assuming that we know the neutron flux and the neutron capture cross section of 
°°Co. This technique is known as neutron activation analysis, and has been used 








10? in many applications in which the elements are present in such small quantities 
7 that chemical identification is not practical. Typically, neutron activation analysis 
o can be used to identify elements in quantities of the order of 10~° g, and sensitivity 
£ ia down to 10! g is often possible. 


Such a sensitive and precise technique finds application in a variety of areas, in 
which the chemical composition must be determined for samples that are available 
only in microscopic quantities or that must be analyzed in a nondestructive manner. 
For example, the chemical composition of various types of pottery can help us 

02 O04 O06 Og trace the geographical origin of the clay from which they were made; such 

E (MeV) analyses of pottery shards can trace the trading routes of prehistoric people. 

Art forgeries can be detected by a knowledge of the chemical composition of 

FIGURE 13.22 Gamma-ray spectrum paints, because techniques for producing pigments have changed over the last 

following neutron activation ofasam- four centuries with corresponding changes in the level of impurities in paints. The 

ple of human hair. The sample shows chemical analysis of tiny quantities of material such as paint, gunshot residues, 

traces of mercury, gold, zinc, cop- _ soil, or hair can provide important evidence in criminal investigations. Neutron 

per, arsenic, antimony,andmanganese. activation analysis of samples of the hair of such historical figures as Napoleon 

[From D. DeSoete et al., Neutron Acti- or Newton has revealed the chemicals to which they were exposed centuries ago. 

vation Analysis (Wiley Interscience, An example of a neutron activation analysis study of a sample of hair is shown in 
1972).] Figure 13.22. 
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Medical Radiation Physics 


One of the most important applications of nuclear physics has been in medicine, 
for both diagnostic and therapeutic purposes. The use of X rays for producing 
images for medical diagnosis is well known, but X rays are of limited value. They 
show distinct and detailed images of bones, but they are generally less useful 
in making images of soft tissue. Radioactive isotopes can be introduced into the 
body in chemical forms that have an affinity for certain organs, such as bone 
or the thyroid gland. A sensitive detector (called a “gamma-ray camera”) can 
observe the radiations from the isotopes that are concentrated in the organ and can 
produce an image that shows how the activity is distributed in the patient. These 
detectors are capable of determining where each gamma-ray photon originates 
in the patient. Figure 13.23 shows an image of the brain, taken after the patient 
was injected with the radioactive isotope ?’Tc (t, j2 = 6h). The images clearly 
show an area of the brain where the activity has concentrated. Ordinarily the brain 
does not absorb impurities from the blood, so such concentrations often indicate 
a tumor or other abnormality. 

Another technique that reveals a wealth of information is positron emission 
tomography (PET), in which the patient is injected with a positron-emitting isotope 
that is readily absorbed by the body. Examples of isotopes used are !°O (t, R= 
2 min), '3N (ti. = 10 min), ''C (i= 20 min), and !8F (ti. = 110 min). These 
isotopes are produced with a cyclotron, and because of the short half-lives the 
cyclotron must be present at the site of the diagnostic facility. When a positron 
emitter decays, the positron quickly annihilates with an electron and produces 
two 511-keV gamma rays that travel in opposite directions. By surrounding the 
patient with a ring of detectors, it is possible to determine exactly where the decay 
occurred, and from a large number of such events, the physician can produce an 
image that reconstructs the distribution of the radioisotope in the patient. One 
advantage of the PET scan over X-ray techniques such as the CAT (computerized 
axial tomography) scan is that it can produce a dynamic image—changes in the 
patient during the measuring time can be observed. Figure 13.24 shows a brain 
scan of a patient who was injected with glucose labeled with !8F. Active areas of 
the brain metabolize glucose more rapidly, and so they become more concentrated 
with '8F, allowing medical workers to observe regions of the brain associated 
with different mental activities. 

Radiation therapy takes advantage of the effect of radiations in destroying 
unwanted tissue in the body, such as a cancerous growth or an overactive thyroid 
gland. The effect of the passage of radiation through matter is often to ionize the 
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FIGURE 13.24 PET scan showing different areas of the brain that are active 
when either hearing words or seeing words. 





(b) 


FIGURE 13.23 Scintillation camera 
image of the brain, following intra- 
venous injection of 20 mCi of °°" Tc. 
(a) Side view, with the patient’s face 
to the left. (b) Back view. The bright 
circular spot shows concentration of 
blood in a lesion, possibly a tumor. 
Other bright areas show the scalp and 
the major veins. 


434 Chapter 13 | Nuclear Reactions and Applications 





Rosalyn Yalow (1921-2011, United 
States). After receiving her Ph.D. in 
nuclear physics, she researched the 
medical applications of radioactive 
isotopes. Yalow developed the tech- 
nique of radioimmunoassay, which 
uses radioactive tracers to measure 
small amounts of substances in the 
blood or other fluids. Her develop- 
ment of this technique was recognized 
with the award of the Nobel Prize in 
medicine in 1977. 


atoms. The ionized atoms can then participate in chemical reactions that lead to 
their incorporation into molecules and subsequent alteration of their biological 
function, possibly the destruction of a cell or the modification of its genetic 
material. For example, an overactive thyroid gland is often treated by giving the 
patient radioactive '7'I, which collects in the thyroid. The beta emissions from 
this isotope damage the thyroid cells and ultimately lead to their destruction. 
Certain cancers are treated by implanting needles or wires containing radium or 
other radioactive substances. The decays of these radioisotopes cause localized 
damage to the cancerous cells. 

Other cancers can be treated using beams of particles that cause nuclear 
reactions within the body at the location of the tumor. Pions and neutrons are 
used for this purpose. The absorption of a pion or a neutron by a nucleus causes 
a nuclear reaction, and the subsequent emission of particles or decays by the 
reaction products again causes local damage that is concentrated at the site of the 
tumor, inflicting maximum damage to the tumor and minimum damage to the 
surrounding healthy tissue. 


Alpha-Scattering Applications 


Radioactive sources emitting alpha particles have been used in a variety of 
applications. Most of these take advantage of the persistence of radioactive 
decay—the decays can be depended upon to occur at a fixed rate in any location. 

Alpha particles from radioactive decay can be absorbed and their energy 
converted into another form, such as electrical power obtained through 
thermoelectric conversion. The power levels are not large (of the order of 1 W per 
gram of material; see Problem 31), but they are sufficient to power many devices, 
from cardiac pacemakers to the Voyager spacecraft, which photographed Jupiter, 
Saturn, and Uranus. 

Scattering of alpha particles emitted by a radioactive source is the basis of 
operation of ionization-type smoke detectors; alpha particles from the decay of 
*4I Am are scattered by the ionized atoms that result from combustion. When the 
smoke detector senses a decrease in the rate at which alphas are counted (due to 
some of them being scattered away from the detector), the alarm is triggered. 

Other applications of alpha-particle scattering are used for materials analysis. 
In Rutherford backscattering, the analysis uses the reduction in energy of an alpha 
particle that is scattered through an angle of 180°. Although our discussion of 
Rutherford scattering in Chapter 6 assumed that the target nucleus was infinitely 
heavy and thus acquired no energy in the scattering, in practice a small amount of 
energy is given even to a heavy nucleus. By allowing the target nucleus to recoil, 
we can find the loss in energy AK of an alpha particle of kinetic energy K that 
scatters through 180° (see Problem 29): 


(13.16) 


4m/M 
AK=K 


(1+ m/My 


where m is the mass of the alpha particle and M is the mass of the target nucleus. 
For a heavy nucleus (m/M = 0.02), the loss in energy is of order 0.5 MeV, 
which is easily measurable. Figure 13.25 shows a sample of the spectrum of 
alpha particles backscattered from a thin foil containing copper, silver, and gold. 
Note the Z? dependence of the scattering probability that characterizes Rutherford 
scattering (see Eq. 6.14), and also note the sensitivity of the technique even to the 
two naturally occurring isotopes of copper. (However, the two isotopes of silver 
cannot be resolved.) The Surveyor spacecraft that landed on the Moon and the 
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FIGURE 13.25 Backscattering spectrum of 2.5-MeV a particles from a thin 
film of copper, silver, and gold. The dashed line shows the Z* behavior of the 
cross section expected from the Rutherford formula. Note the appearance of the 
two isotopes of copper. [From M.-A. Nicolet, J. W. Mayer, and I. V. Mitchell, 
Science 177, 841 (1972). Copyright ©1972 by the AAAS. ] 


Viking landers on Mars carried Rutherford backscattering experiments to analyze 
the chemical composition of the surface of those bodies. 


Superheavy Elements 


The known atoms beyond uranium (Z = 92) are all radioactive, with half-lives 
short compared with the age of the Earth. They are therefore not present in 
terrestrial matter, but they can be produced in the laboratory. The production 
process for the series of elements beginning with neptunium (Z = 93), called 
transuranic elements, follows the same process outlined in Section 13.6: neutron 
capture followed by beta decay. Using similar techniques researchers have 
produced elements up to Z = 100 (fermium). Beyond this point, too few atoms 
are produced for neutron capture to reveal the presence of the next element. 
Instead, reactions with accelerated charged particles are used. 

Many of the elements in this series have half-lives of only minutes or seconds, 
and thus the production and identification of these elements requires painstaking 
experimental efforts—the isotopes are often produced in quantities of a few 
atoms! Although most of these elements have not been produced in sufficient 
quantity to study their chemical properties, it is expected that their place in the 
periodic table will be as shown in Figure 13.26, up to the inert gas with Z = 118. 
All elements up to 118 have been observed (although names for elements beyond 
Z = 112 have not yet been chosen—the symbols shown in Figure 13.26 represent 
placeholder names). 

The extreme instability of these transuranic elements results from the increased 
Coulomb repulsion of the nuclear protons as Z increases; these elements decay 
by alpha decay or by spontaneous fission. However, strong theoretical evidence 
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FIGURE 13.26 New massive elements in the periodic table. 


suggests that elements around Z = 114, N = 184 should be stable against alpha 
decay, beta decay, and spontaneous fission. This region around element 114 is 
often called the “island of stability.” 

Although the production and observation of nuclei of such superheavy elements 
are not likely to have any immediate applications, their study would be of great 
interest to test our understanding of the ordering of the periodic table, and the 
comparison of their chemical and physical properties with those of the 5d and 6p 
elements would be a test of our ordering of the elements. Many of the artificially 
produced transuranic elements have already found applications in research and 
technology. The alpha decays of 7°*Pu and 73?Pu have been used as power sources 
for spacecraft, and 74'Am serves as an alpha source for smoke detectors. The 
radioisotope *°*Cf decays by spontaneous fission; the neutrons released in the 
decay have many applications, including medical treatment and materials analysis. 
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12. 


The cross sections for reactions induced by protons gener- 
ally increase as the kinetic energy of the proton increases, 
while cross sections for neutron-induced reactions gener- 
ally decrease with increasing neutron energy. Explain this 
behavior. 

Cross sections for reactions induced by thermal neutrons 
(K ~ kT, where T is room temperature) are often several 
orders of magnitude larger than cross sections for the same 
reaction induced by fast neutrons (K ~ MeV). Justify this 
difference by comparing the time spent in the vicinity of a 
target nucleus by a thermal neutron and a fast neutron. 
When two nuclei approach one another in a nuclear reac- 
tion, there is a Coulomb repulsion between them. Does this 
potential energy affect the kinematics of the reaction? Does 
it affect the cross section? 

The most abundant component of our atmosphere is '4N. 
Assuming that cosmic rays supply sufficient high-energy 
protons and neutrons, explain how the radioactive isotopes 
'S and 37H can be formed. 

Consider the photodisintegration reaction A + y > B+C. 
In terms of the binding energies of A, B, and C, what are the 
requirements for this reaction? Would you expect to observe 
photodisintegration more readily for light nuclei (A < 56) 
or heavy nuclei (4 > 56)? 

Would you expect to observe the radiative capture of an 
alpha particle ¥ + a@ — X'+ y for heavy nuclei? 

In what sense is photodisintegration (see Question 5) the 
inverse of radiative capture (see Question 6)? How are the 
photon energies related? 

Comment on the following statement: The fission reaction 
is useful for energy production because of the large kinetic 
energies given to the neutrons emitted following fission. 
238) is fissionable, but only with neutrons in the MeV range. 
Explain why 728U is not a suitable reactor fuel. 

What is the difference between a slow neutron and a delayed 
neutron? Between a fast neutron and a prompt neutron? 

In a typical fission reaction, which fragment (heavier or 
lighter) has the larger kinetic energy? The larger momen- 
tum? The larger speed? 

When charged particles travel in a medium faster than light 
travels in that medium, Cerenkov radiation is emitted. This 
is the origin of the blue glow of the water that surrounds a 
reactor core. What might be the identity of these charged 


13.1 Types of Nuclear Reactions 


1. 


(a) *He+ 4N> 1704+ — () 
(b) °Be+*He > ?C+ 


Fill in the missing particle in these reactions: 


27A1+4He > n+ 
(7d) PC+ > BN+n 


13. 


14. 


15. 


16. 


17. 


18. 


19, 


20. 


21. 


22. 


23. 
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particles from a reactor? What velocities and kinetic energies 
do they need to produce Cerenkov radiation? (The index of 
refraction of water is 1.33.) 

In general, would you expect fission fragments to decay by 
positive or negative beta decay? Why? 

Among the fission products that build up in reactor fuel 
elements is xenon, which has an extremely large neutron 
capture cross section (see Section 13.1). What effect does 
this buildup have on the operation of the reactor? 

Helium has virtually no neutron absorption cross section. 
Would helium be a better reactor moderator than carbon, 
which has a small but nonzero cross section? 

Estimate the number of fissions per second that must occur 
in a 1000-MW power plant, assuming a 30% efficiency of 
energy conversion. 

The fission cross section for 7*>U for slow neutrons is about 
10° times the fission cross section of 7>°U for slow neutrons, 
yet for fast neutrons the fission cross sections of 7>U and 
238U are roughly the same. Explain this effect. 

Consider two fragments of uranium fission with atomic 
numbers Z and 92 — Z. Estimating their mass numbers as 
2.5 times their atomic numbers, find an expression for the 
Coulomb potential energy of the two fragments when they 
are just touching, and show that this expression is maxi- 
mized when the two fragments are identical. Why then is 
the fission fragment mass distribution, Figure 13.9, not a 
maximum at A = 118? 

Assume that 7°U splits into two fragments with mass num- 
bers of 90 and 145, with each fragment having roughly 
the same ratio of Z/A as >U. On this basis, explain why 
neutrons are emitted in fission. 

Why is it necessary to convert a proton to a neutron in the 
first step of the proton-proton fusion cycle? Why can’t two 
protons fuse directly? 

Explain why a fusion reactor requires a high particle density, 
a high temperature, and a long confinement time. 

In the argument leading to Lawson’s criterion, Eq. 13.15, it 
was mentioned that the power necessary to heat the plasma 
is proportional to the particle density n, while the power 
obtained from fusion is proportional to n*. Explain these 
two proportionalities. 

Why do radioactive decay power sources use alpha emitters 
rather than beta emitters? 


In a certain nuclear reaction, outgoing protons are observed 
with energies 16.2 MeV, 14.8 MeV, 11.6 MeV, 8.9 MeV, and 
6.7 MeV. No energies higher than 16.2 MeV are observed. 
Construct a level scheme of the product nucleus. 
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13.2 


11. 


In order to determine the cross section for neutron capture, 
you are irradiating a thin gold foil, in the form of a circular 
disk of diameter 3.0mm and thickness 1.81 2m, with neu- 
trons to produce the reaction n+ !°7Au—> !8Au+ y. By 
observing the outgoing gamma-ray photons in a detector, 
you determine that the gold decays at a rate of 5.37 x 10° per 
second. From an independent measurement, you have deter- 
mined the neutron flux to be 7.25 x 10!° neutrons/cm?/s. 
What value do you deduce for the cross section for this 
reaction? 

The element cobalt is commonly used for measuring the 
intensity of neutron beams through the reaction n + *?Co > 
Co + y. By observing the radioactive decay of Co, it is 
possible to deduce the rate at which it is produced in the 
reaction. The cross section for this reaction is 37.0 b. A 
thin disk of Co-Al alloy has a diameter of 1.00cm and a 
mass of 46 mg; the alloy contains 0.44% Co by weight. With 
neutrons spread uniformly over the surface of the foil, it is 
concluded that ©°Co is produced at the rate of 1.07 x 10! 
per second. What is the rate at which neutrons strike the 
target? 

A beam of 20.0 4A of protons is incident on 2.0cm* of a 
target of !°7 Ag of thickness 4.5 zm producing the reaction 
p+!7Ag — !%Cd-+ 3n. Neutrons are observed at a rate 
of 8.5 x 10° per second. What is the cross section for this 
reaction at this proton energy? 

A beam of alpha particles is incident on a target of Cu, 
resulting in the reaction a + Cu — ® Ga +n. Assume the 
cross section for the particular alpha energy to be 1.25 b. 
The target is in the form of a foil, 2.5 44m thick. The beam 
has a circular cross section of diameter 0.50 cm and a current 
of 7.5 A. Find the rate of neutron emission. 


Radioisotope Production in Nuclear Reactions 


A radioactive isotope of half-life t; ,, is produced in a nuclear 
reaction. What fraction of the maximum possible activity 
is produced in an irradiation time of (a) t,/.; (6) 2t,,2; 
(c) 4t;2? 

List five nuclear reactions, consisting of a light stable projec- 
tile nucleus (mass 4 or less) incident on a heavy stable target 
nucleus, that can produce the radioactive nucleus *°Co. 
Show that Eq. 13.5 is a solution to Eq. 13.4. 

The radioisotope 5O (= 122s) is used to measure res- 
piratory function. Patients inhale the gas, which is made 
by irradiating nitrogen gas with deuterons (7H). Consider a 
cubical cell measuring 1.24cm on each edge, which holds 
nitrogen gas at a pressure of 2.25 atm and a temperature of 
293 K. One face of the cube is uniformly irradiated with a 
deuteron beam having a current of 2.05 A. At the chosen 
deuteron energy, the reaction cross section is 0.21 b. (a) At 
what rate is !°O produced in the cell? (b) After an irradiation 
lasting for 60.0 s, what is the activity of !O in the cell? 
Neutron capture in sodium occurs with a cross section of 
0.53 b and leads to radioactive 4Na (t, 2 = 15h). What is 


the activity that results when 1.0 wg of Na is placed in a 
neutron flux of 2.5 x 10/3 neutrons/cm2/s for 4.0 h? 


13.3 Low-Energy Reaction Kinematics 


12. 


13. 


14. 


15. 


16. 


Derive Eq. 13.14 from Eq. 13.13. 

Find the Q value of the reactions: 

(a) p+°>Mn > *4Fe + 2n 

(b) 3He + #°Ar > 4'K + 2H 

Find the Q value of the reactions: 

(a) “Lit+n— 7H+‘4He 

(b) p+*H—> 2p+n 

(c) 7Li+?H > ®Be+n 

In the reaction 7H + *He > p+ “He, deuterons of energy 
5.000 MeV are incident on *He at rest. Both the proton and 
the alpha particle are observed to travel along the same 
direction as the incident deuteron. Find the kinetic energies 
of the proton and the alpha particle. 

(a) What is the Q value of the reaction p+*He > 
2H + He? (b) What is the threshold energy for protons 
incident on “He at rest? (c) What is the threshold energy if 
4He are incident on protons at rest? 


13.4 Fission 


17. 


18. 


19. 


20. 


(a) Find the Q value of the fission decay 
25406 > 127In +n, in which *4Cf splits in half. 
(6) Find the Q value for the more probable fission 
process 734Cfisq > '0Xeg, + SRug, + 4n. Masses are: 
m(27Jn) = 126.917353 u, m(4°Xe) = 139.921641 u, 
m(!©Ru) = 109.914136u. 

Find the energy released in the fission of 1.00 kg of uranium 
that has been enriched to 3.0% in the isotope 7°°U. 

We can understand why **>U is readily fissionable, and 73°U 
is not, with the following calculation. (a) Find the energy 
difference between 7>>U + nand**°U. We can regard this as 
the “excitation energy” of °°U. (b) Repeat for 78U + n and 
239U. (c) Comparing your results for (a) and (b), explain why 
?35U) will fission with very low energy neutrons, while 77°U 
requires fast neutrons of 1 to 2MeV of energy to fission. 
(d) Froma similar calculation, predict whether ~*°Pu requires 
low-energy or higher-energy neutrons to fission. 

Find the Q value (and therefore the energy released) in 
the fission reaction °U+n— ?Rb+ !4'Cs + 2n. Use 
m(>Rb) = 92.922042 u and m(!4!Cs) = 140.920046 u. 


13.5 Fusion 


21. 


22. 


23. 


(a) Calculate the O value for the six reactions or decays of 
the carbon cycle of fusion. (b) By accounting for the electron 
masses, show that the total O value for the carbon cycle is 
identical with that of the proton-proton cycle. 

Show that the D-T fusion reaction releases 17.6 MeV of 
energy. 

In the D-T fusion reaction, the kinetic energies of 7H and 7H 
are small, compared with typical nuclear binding energies. 
(Why?) Find the kinetic energy of the emitted neutron. 


24. 


13.6 
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(a) If a tokamak fusion reactor were able to achieve a con- 
finement time of 0.60 s, what minimum particle density is 
required? (b) If the reactor were able to achieve 10 times 
the density found in part (a), what is the minimum plasma 
temperature required for ignition of a self-sustaining fusion 
reaction? 


Nucleosynthesis 


Find the energy released when three alpha particles combine 
to form !*C. 

To what temperature must helium gas be heated before the 
Coulomb barrier is overcome and fusion reactions begin? 
Trace the path of the s process from the stable isotope Cu 
to the stable isotope >As, showing the neutron capture and 
beta decay processes. 

Show how the s process proceeds from stable *'Br to stable 
Mo. 


Applications of Nuclear Physics 


An alpha particle of mass m makes an elastic head-on colli- 
sion with an atom of mass / at rest. Show that the loss in 
kinetic energy of the alpha particle is given by Eq. 13.16. 
(a) Calculate the energy loss of a 2.50-MeV alpha particle 
after backscattering from an atom of copper, silver, and gold. 
Compare your calculated values with the peak energies in 
Figure 13.25. (b) Calculate the expected energy difference 
between the peaks for the two isotopes of copper and also 
for the two isotopes of silver. Explain why the silver peaks 
are closer together than the copper peaks. Can you estimate 
the relative abundances of the two isotopes of copper from 
the figure? 

A radioactive source is to be used to produce electrical 
power from the alpha decay of **Pu (t; 7 = 88y). (a) What 
is the O value for the decay? (6) Assuming 100% conver- 
sion efficiency, how much power could be obtained from 
the decay of 1.0 g of 78Pu? 


General Problems 


32; 


33. 


A small sample of paint is placed in a neutron flux of 
3.0 x 10!* neutrons/cm?/s for a period of 2.5 min. At the 
end of that period the activity of the sample is found 
to include 105 decays/s of *!Ti (¢;;. = 5.8 min) and 12 
decays/s ©°Co (¢, j = 5.27y). Find the amount, in grams, of 
titanium and cobalt in the original sample. Use the following 
information: Cobalt is pure °?Co, which has a cross section 
of 19 b; titanium is 5.25 percent °°Ti, which has a cross 
section of 0.14 b. 

A 2.0-mg sample of copper (69% ©Cu,31% Cu) is 
placed in a reactor where it is exposed to a neutron flux 


34. 


35: 


36. 


37. 


38. 


39. 


40. 
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of 5.0 x 10!” neutrons/em?/s. After 10.0 min the resulting 
activities are 72 Ci of Cu (t1/2 = 12.7h) and 1.30 mCi 
of Cu (4, j2=5.1 min). Find the cross sections of Cu 
and Cu. 

A beam of neutrons of intensity / is incident on a thin 
slab of material of area A, thickness dx, density p, and 
atomic weight M. The neutron absorption cross section is o. 
(a) What is the loss in intensity d/ of this beam in passing 
through the material? (6) A beam of original intensity J) 
passes through a thickness x of the material. Show that the 
intensity of the emerging beam is J = Jje-"°*, where n is 
the number of absorber nuclei per unit volume. (c) Assume 
that the total cross section for neutrons incident on copper 
is 5.0 b. What fraction of the intensity of a neutron beam 
is lost after traveling through copper of thickness 1.0mm? 
1.0cm? 1.0 m? 

A reaction in which two particles join to form a single 
excited nucleus, which then decays to its ground state by 
photon emission, is known as radiative capture. Find the 
energy of the gamma ray emitted in the radiative capture 
of an alpha particle by 7Li. Assume alpha particles of very 
small kinetic energy are incident on ‘Li at rest. 

How much energy is required (in the form of gamma-ray 
photons) to break up 7Li into 7H + +He? This reaction is 
known as photodisintegration. 

The nucleus '!3Cd captures a thermal neutron (K = 
0.025 eV), producing '!4Cd in an excited state; the excited 
state of ''4Cd decays to the ground state by emitting a 
photon. Find the energy of the photon. 

When a neutron collides head-on with an atom at rest, the 
loss in its kinetic energy is given by Eq. 13.16. (a) What 
fraction of its energy will a neutron lose in a head-on col- 
lision with an atom of hydrogen, deuterium, or carbon? 
(6) Consider a neutron with an initial energy of 2.0 MeV. 
How many head-on collisions must it make with carbon 
atoms for its energy to be reduced to the thermal range 
(0.025 eV)? (c) Is the result of part (b) an underestimate or 
an overestimate of the actual number of collisions necessary 
to “thermalize” the neutrons? Explain. 

Suppose we have 100.0cm? of water, which is 0.015% 
D,O. (a) Compute the energy that could be obtained if all 
the deuterium were consumed in the 7H +7H > 7H+p 
reaction. (6) As an alternative, compute the energy released 
if two-thirds of the deuterium were fused to form 37H, which 
is then combined with the remaining one-third in the D-T 
reaction. 

(a) Find the Q value of the reaction *He + He > Be. 
(b) In a gas of *He at a temperature of 108 K, estimate the 
relative amount of ®Be present. 
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The search for the basic building blocks of nature has occupied the thoughts of 
scientific investigators since the Greeks introduced the idea of atomism 2500 years 
ago. As we look carefully at complex structures, we find underlying symmetries 
and regularities, that help us to understand the laws that determine how they 
are put together. The regularities of crystal structure, for example, suggest to us 
that the atoms of which the crystal is composed must follow certain rules for 
arranging themselves and joining together. As we look more deeply, we find that 
although nature has constructed all material objects out of roughly 100 different 
kinds of atoms, we can understand these atoms in terms of only three particles: 
the electron, proton, and neutron. Our attempts to look further within the electron 
have been unsuccessful—the electron seems to be a fundamental particle, with 
no internal structure. However, when nucleons collide at high energy, the result 
is more complexity rather than simplicity; hundreds of new particles can emerge 
as products of these reactions. If there are hundreds of basic building blocks, it 
seems unlikely that we could ever uncover any fundamental dynamic laws of their 
behavior. However, experiments show a new, underlying regularity that can be 
explained in terms of a small number of truly fundamental particles called quarks. 

In this chapter, we examine the properties of many of the particles of physics, the 
laws that govern their behavior, and the classifications of these particles. We also 
show how the quark model helps us to understand some properties of the particles. 


14.1 THE FOUR BASIC FORCES 


All of the known forces in the universe can be grouped into four basic types. 
In order of increasing strength, these are: gravitation, the weak interaction, 
electromagnetism, and the strong interaction. 


1. The Gravitational Interaction — Gravity is of course exceedingly important 
in our daily lives, but on the scale of fundamental interactions between particles 
in the subatomic realm, it is of no importance at all. To give a relative figure, 
the gravitational force between two protons just touching at their surfaces is 
about 10~%8 of the strong force between them. The principal difference between 
gravitation and the other interactions is that, on the practical scale, gravity is 
cumulative and infinite in range. Tiny gravitational interactions, such as the force 
exerted by one atom of the Earth on one atom of your body, combine to produce 
observable effects. The other forces, while much stronger than gravity at the 
microscopic level, do not affect objects on the large scale, either because they 
have a short range (the strong and weak forces) or their effect is negated by 
shielding (electromagnetism). 


2. The Weak Interaction The weak interaction is responsible for nuclear beta 
decay (see Section 12.8) and other similar decay processes involving fundamental 
particles. It does not play a major role in the binding of nuclei. The weak force 
between two neighboring protons is about 10~ of the strong force between them, 
and the range of the weak force is on the scale of 0.001 fm. Nevertheless, the 
weak force is important in understanding the behavior of fundamental particles, 
and it is critical in understanding the evolution of the universe. 


3. The Electromagnetic Interaction Electromagnetism is important in the 
structure and the interactions of the fundamental particles. For example, some 
particles interact or decay primarily through this mechanism. Electromagnetic 
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forces are of infinite range, but the shielding effect generally diminishes their 
effect for ordinary objects. Many common macroscopic forces (such as friction, 
air resistance, drag, and tension) are ultimately due to electromagnetic forces at the 
atomic level. Within the atom, electromagnetic forces dominate. The electromag- 
netic force between neighboring protons in a nucleus is about 10~? of the strong 
force, but within the nucleus the electromagnetic forces can act cumulatively 
because there is no shielding. As a result, the electromagnetic force can compete 
with the strong force in determining the stability and the structure of nuclei. 


4. The Strong Force The strong force, which is responsible for the binding of 
nuclei, is the dominant one in the reactions and decays of most of the fundamental 
particles. However, as we shall see, some particles (such as the electron) do not 
feel this force at all. It has a relatively short range, on the order of 1 fm. 


The relative strength of a force determines the time scale over which it acts. If 
we bring two particles close enough together for any of these forces to act, then 
a longer time is required for the weak force to cause a decay or reaction than for 
the strong force. As we shall see, the mean lifetime of a decay process is often 
a signal of the type of interaction responsible for the process, with strong forces 
being at the shortest end of the time scale (often down to 1077? s). Table 14.1 
summarizes the four forces and some of their properties. 

Particles can interact with one another in decays and reactions through any 
of the basic forces. Table 14.1 indicates which particles can interact through 
each of the four forces. All particles can interact through the gravitational and 
weak forces. A subset of those can interact through the electromagnetic force (for 
example, the neutrinos are excluded from this category), and a still smaller subset 
can interact through the strong force. When two strongly interacting particles are 
within the range of each other’s strong force, we can often neglect the effects 
of the weak and electromagnetic forces in decay and reaction processes; because 
their relative strengths are so much smaller than that of the strong force, their 
effects are much smaller than those of the strong force. (However, these forces 
are not always negligible—the weak interaction between protons is responsible 
for a critical step in one of the fusion processes that occurs in stars.) 

Even though the proton is a strongly interacting particle, a proton and an 
electron will never interact through the strong force. The electron is able to ignore 
the strong force of the proton and respond only to its weak or electromagnetic 
force. 

Each of the four forces can be represented in terms of the emission or 
absorption of particles that carry the interaction, just as we represent the 
force between nucleons in the nucleus in terms of the exchange of pions (see 
Section 12.4). Associated with each type of force is a field that is carried by its 
characteristic particle, as shown in Table 14.2. 


TABLE 14.1 The Four Basic Forces 
































Type Range Relative Strength Characteristic Time Typical Particles 
Strong 1 fm 1 = (0p 6 m,K,n,p 
Electromagnetic oo 10-7 10“ = 10" e, ,7, K,n, p 
Weak 10-3 fm 10 10-§— 10-8 s All 
Gravitational ero) (res Years All 
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TABLE 14.2 The Field Particles 














Force Field Particle Symbol Charge (e) Spin (f) Rest Energy (GeV) 
Strong Gluon g 0 1 0 
Electromagnetic Photon y 0 1 0 
wr, W- +1 1 80.4 
Weak k d , 
ea Weak boson 70 0 1 912 
Gravitational Graviton 0 2 0 


























¢ The strong force between quarks is carried by particles called gluons, 
which have been observed through indirect techniques. 

¢ The electromagnetic force between particles can be represented in terms 
of the emission and absorption of photons. 

* The weak force is carried by the weak bosons W* and Z°, which are 
responsible for processes such as nuclear beta decay. For example, the 
beta decay of the neutron (a weak interaction) can be represented as 





n—>p+W- followed by W >e +P, 


Because the decay n — p+ W~ would violate energy conservation, the 
existence of the W~ is restricted by the uncertainty principle, and its range 
can be determined in a manner similar to that of the pion (see Eq. 12.8). 

¢ The gravitational force is carried by the graviton, which is expected to 
exist based on theories of gravitation but has not yet been observed. 


14.2 CLASSIFYING PARTICLES 


One way of studying the elementary particles is to classify them into different 
categories based on certain behaviors or properties and then to look for similarities 
or common characteristics among the classifications. We have already classified 
some particles in Table 14.1 according to the types of forces through which they 
interact. Another way of classifying them might be according to their masses. 
In the early days of particle physics, it was observed that the lightest particles 
(including electrons, muons, and neutrinos) showed one type of behavior, the 
heaviest group (including protons and neutrons) showed a different behavior, and 
a middle group (such as pions and kaons) showed a still different behavior. The 
names originally given to these groups are based on the Greek words for light, 
middle, and heavy: Jeptons for the light particles, mesons for the middle group, 
and baryons for the heavier particles. Even though the classification by mass is 
now obsolete (leptons and mesons have been discovered that are more massive 
than protons or neutrons), we keep the original names, which now describe instead 
a group or family of particles with similar properties. When we compare our first 
two ways of classifying particles, we find an interesting result: The leptons do not 
interact through the strong force, but the mesons and baryons do. 

We can also classify particles by their intrinsic spins. Every particle has an 
intrinsic spin; you will recall that the electron has a spin of 1, as do the proton and 
neutron. We find that the leptons all have spins of 1/2, the mesons all have integral 
spins (0, 1,2,...), and the baryons all have half-integral spins (1/2, 3/2, 5/2,...). 


Antiparticles 


One additional property that is used to classify a particle is the nature of its 
antiparticle.* Every particle has an antiparticle, which is identical to the particle 
in such properties as mass and lifetime, but differs from the particle in the sign of 
its electric charge (and in the sign of certain other properties, as we discuss later). 
The antiparticle of the electron is the positron et, which was discovered in the 
1930s through reactions initiated by cosmic rays. The positron has a charge of +e 
(opposite to that of the electron) and a rest energy of 0.511 MeV (identical to that 
of the electron). The antiproton p was discovered in 1956 (see Example 2.18); it 
has a charge of —e and a rest energy of 938 MeV. A stable atom of antihydrogen 
could be constructed from a positron and an antiproton; the properties of this atom 
would be identical to those of ordinary hydrogen. 

Antiparticles of stable particles (such as the positron and the antiproton) 
are themselves stable. However, when a particle and its antiparticle meet, the 
annihilation reaction can occur: the particle and antiparticle both vanish, and 
instead two or more photons can be produced. Conservation of energy and 
momentum requires that, neglecting the kinetic energies of the particles, when 
two photons are emitted each must have an energy equal to the rest energy of the 
particle. Examples of annihilation reactions are: 


eo + et > Vit (24 = £,. — 0.511 MeV) 
P+p?7nty G, =£, = 938MeVv) 


We call the kind of stuff of which we are made matter and the other kind of 
stuff antimatter. There may indeed be galaxies composed of antimatter, but we 
cannot tell by the ordinary techniques of astronomy, because light and antilight 
are identical! To put it another way, the photon and antiphoton are the same 
particle, so matter and antimatter emit the same photons. The only way to tell the 
difference is by sending a chunk of our matter to the distant galaxy and seeing 
whether or not it is annihilated with the corresponding emission of a burst of 
photons. (It is indeed possible, but highly unlikely, that the first astronaut to travel 
to another galaxy may suffer such a fate! The first intergalactic handshake would 
indeed be quite an event!) 

In our classification scheme it is usually easy to distinguish particles from 
antiparticles. We begin by defining particles to be the stuff of which ordinary mat- 
ter is made—electrons, protons, and neutrons. Ordinary matter is not composed of 
neutrinos, so we have no basis for distinguishing a neutrino from an antineutrino, 
but the conservation laws in the beta decay process can be understood most easily 
if we define the antineutrino to be the particle that accompanies negative beta 
decay and the neutrino to be the particle that accompanies positron decay and 
electron capture. For a heavy baryon, such as the A (lambda), we take advantage 
of its radioactive decay, which leads eventually to ordinary protons and neutrons; 
that is, the A is the particle that decays to n, and the A (“anti-lambda’’) therefore 
decays to n. Similarly, in the case of the leptons, the z~ and the jz* are antiparti- 
cles of one another; because jz~ decays to ordinary e~ (and has many properties 
in common with the electron) it is the particle, while jx* is the antiparticle. 


*We use two systems to indicate antiparticles. Sometimes the symbol for the particle will be written 
along with the electric charge to indicate particle or antiparticle, as, for example, e* and e~, or xt 
and yz”. Other times the antiparticle will be written with a bar over the symbol—for example, v and 
D or pand p. 
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Three Families of Particles 


Table 14.3 summarizes the three families of material particles. 


Leptons The leptons interact only through the weak or electromagnetic inter- 
actions. No experiment has yet been able to reveal any internal structure for the 
leptons; they appear to be truly fundamental particles that cannot be split into still 
smaller particles. All known leptons have spin 1. 

Table 14.4 shows the six known leptons, grouped as three pairs of particles. 
Each pair includes a charged particle (e~, w~, tT) and an uncharged neutrino 
(v., Vy» ¥,). Each lepton has a corresponding antiparticle. We have already 
discussed the electron neutrino and antineutrino in connection with beta decay 
(Section 12.8), and the decay of cosmic-ray muons was discussed as confirming 
the time dilation effect in special relativity (Section 2.4). The neutrino masses 
are very small but nonzero. The rest-energy limits shown in Table 14.4 come 
from attempts at direct measurement, but indirect evidence from astrophysics and 
cosmology suggests that the rest energies of all three neutrinos are less than | eV. 


Mesons_ Mesonsare strongly interacting particles having integral spin. A partial 
list of some mesons is given in Table 14.5. Mesons can be produced in reactions 
through the strong interaction; they decay to other mesons or leptons through the 
strong, electromagnetic, or weak interactions. For example, pions can be produced 
in reaction of nucleons, such as 





















































ptn—~pt+pt+z or p+n—>p+n4+qz° 
TABLE 14.3 Families of Particles 
Family Structure Interactions Spin Examples 
Leptons Fundamental Weak, electromagnetic Half integral ev 
Mesons Composite Weak, electromagnetic, strong Integral HE VK 
Baryons Composite Weak, electromagnetic, strong Half integral p,n 
TABLE 14.4 The Lepton Family 
Particle Spin Rest Energy Mean Life Typical Decay 
Particle Antiparticle Charge (e) (A) (MeV) (s) Products 
e et -1 5 0.511 00 
Ve Ve 0 5 <7 2/eVi 0° 
fa pr —1 5 105.7 Peal eae sar, 
Vu Ue 0 5 < 0.19 00 
oe ae -1 5 1777 2.9x10-3 | po +v,+0, 
v, D, 0 5 < 18 00 





























TABLE 14.5 Some Selected Mesons 
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Charge* | Spin Rest Energy | Mean Life Typical Decay 
Particle | Antiparticle (e) (fA) Strangeness* (MeV) (s) Products 
ous a +1 0 0 140 26 x 10% ut +o, 
n° eo 0 0 0 135 8:4 s< 107! y+y 
re K- +1 0 +1 494 12x 10% ut+v, 
K° K 0 0 a 498 0.9 x 10-10 at +n 
n n 0 0 0 548 Sal sella y+y 
pt pa +1 1 0 TS 4.4 x 10-7+ nt + 7° 
7’ 7 0 0 0 958 3.2 x 10-7! neaut+ta 
De Dip +1 0 0 1869 10x 107% | K-+a+4+at 
J/wv J/wv 0 1 0 3097 Tie 1s et + e7 
Bt Re +1 0 0 5279 16x 10-4 | Do+at4a27 
ap AG 0 1 0 9460 210s" et +e7 





























*The charge and strangeness are those of the particle. Values for the antiparticle have the opposite sign. The spin, rest energy, and 


mean life are the same for a particle and its antiparticle. 


and the pions can decay according to 


(mean life = 2.6 x 1078 s) 
(mean life = 8.4 x 107!” s) 


To > eh +d, 
wr o>yt+y 


The first decay is caused by the weak interaction (indicated by the lifetime and by 
the presence of a neutrino among the decay products) and the second is caused by 
the electromagnetic interaction (indicated by the lifetime and the photons). 

Because mesons are not observed in ordinary matter, the classification into 
particles and antiparticles is somewhat arbitrary. For the charged mesons such as 
zt and z~ or Kt and K~, which are not part of ordinary matter, the positive and 
negative particles are antiparticles of one another but there is no way to choose 
which is matter and which is antimatter. For some uncharged mesons (such as 7° 
and 7) the particle and antiparticle are identical, while for others (such as K° and 
K’) they may be distinct. 


Baryons The baryons are strongly interacting particles with half-integral spins 
(1/2, 3/2,...). A partial listing of some baryons is given in Table 14.6. Like the 
leptons, the baryons have distinct antiparticles. Like the mesons, the baryons 
can be produced in reactions with nucleons through the strong interaction; for 
example, the A° baryon can be produced in the following reaction: 


pap p+A’+Kt 
The A° then decays through the weak interaction according to 


APs p+ (mean life = 2.6 x 107!" s) 
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TABLE 14.6 Some Selected Baryons 









































Charge* Spin Rest Energy | Mean Life | Typical Decay 
Particle | Antiparticle (e) (hf) | Strangeness* (MeV) (s) Products 
p p +1 5 0 938 00 
n i 0 4 0 940 886 pte +v, 
Ao Te 0 i =i 1116 26x10 |  p+a- 
a =e +1 2 -1 1189 8.0 x 107! p+7° 
= a 0 i -1 1193 TAx10-9 | A%4y 
= Dy -1 5 -1 1197 1052 n+77 
5° cle 0 : = 1315 DO || eae 
om comm -1 —2 1322, 610) Als 
A* Ie +2,+1,0,-1| 3 0 1232 5.6 x 10-4 p+x 
ye 53h FLO 3 = 1385 1.8 x 10-23 [che 
E* =" —1,0 3 -2 1533 ge llOn = E+nx 
Q- QO -1 ; —3 1672 82 aloe A°+K™ 





























*The charge and strangeness are those of the particle. Values for the antiparticle have the opposite sign. The spin, rest energy, and 
mean life are the same for a particle and its antiparticle. 


Even though neutrinos are not produced in this decay process, the lifetime 
indicates that the decay proceeds through the weak interaction. Other baryons 
can be identified in Table 14.6 that decay through the strong, electromagnetic, or 
weak interactions. 


14.3 CONSERVATION LAWS 


In the decays and reactions of elementary particles, conservation laws provide a 
way to understand why some processes occur and others are not observed, even 
though they are expected on the basis of other considerations. We frequently 
use the conservation of energy, linear momentum, and angular momentum in our 
analysis of physical phenomena. These conservation laws are closely connected 
with the fundamental properties of space and time; we believe those laws to be 
absolute and inviolable. 

We also use other kinds of conservation laws in analyzing various processes. 
For example, when we combine two elements in a chemical reaction, such as 
hydrogen + oxygen — water, we must balance the reaction in the following way: 


2H, + O, > 2H,O 


The process of balancing a reaction can also be regarded as a way of accounting for 
the electrons that participate in the process: A molecule of water contains 10 elec- 
trons, and so the atoms that combine to make up the molecule must likewise include 
10 electrons. 


In nuclear processes, we are concerned not with electrons but with protons and 
neutrons. In the alpha decay of a nucleus, such as 


235 231 4 
92U 143 > “oo Thy4, + 2He, 
or in a reaction such as 
63 63 
P + 79Cu34 > 39Zn33 +n 


we balance the number of protons and also the number of neutrons. We might 
be tempted to conclude that nuclear processes conserve both proton number 
and neutron number, but the separate conservation laws are not satisfied in beta 
decays, for example 


n>pt+e +v, 


which does not conserve either neutron number or proton number. However, it 
does conserve the total neutron number plus proton number, which is equal to 1 
both before and after the decay. (This conservation law of total nucleon number 
includes the separate laws of conservation of proton number and neutron number 
as a special case.) 


Lepton Number Conservation 


In negative beta decay we always find an antineutrino emitted, never a neutrino. 
Conversely, in positron beta decay, it is the neutrino that is always emitted. We 
account for these processes by assigning each particle a /epton number L. The 
electron and neutrino are assigned lepton numbers of +1, and the positron and 
antineutrino are assigned lepton numbers of —1; all mesons and baryons are 
assigned lepton numbers of zero. Lepton number conservation in positive and 
negative beta decay then works as follows: 


n>pt+e 4+ Y, 
L:0>0+4+ 1 + €41) 
pon+ ew + » 
L:0>04(-lI+ 1 


You can see that the total lepton number is 0 both before and after these decays, 
which accounts for the appearance of the antineutrino in negative beta decay and 
the neutrino in positron decay. 

According to the lepton conservation law, these processes are forbidden: 


e +p>ne4+ YD, 
L: 1+07>0+4 (1) 


poet+y 
£L£:07> -1+ 0 


In keeping track of leptons, we must count each type of lepton (e, mu, T) 
separately. Evidence for this comes from a variety of experiments. For example, 
the distinction between electron-type and muon-type leptons is clear from an 
experiment in which a beam of muon-type antineutrinos is incident on a target of 
protons: 


v,+p>ntpt 
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Emmy Noether (1882—1935, Ger- 
many-United States). Known both 
as a mathematician and a theoreti- 
cal physicist, she explored the role 
of conservation laws in physics. In 
an important result now known as 
Noether’s theorem, she discovered 
that each symmetry of the mathe- 
matical equations describing a phe- 
nomenon gives a conserved quan- 
tity. For example, the symmetry 
of equations to translations in time 
leads to conservation of energy, 
and the invariance to translations in 
space leads to conservation of linear 
momentum. 
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If there were no difference between electron-type and muon-type leptons, the 
following reaction would be possible: v,, + p > n+ et. However, this outcome 
is never observed, which indicates the distinction between the two types of leptons 
and the need to account separately for each type. 

Another example of the difference between the types of leptons comes from 
the failure to observe the decay 1- > e + y. If there were only one common 
type of lepton number, this decay would be possible. The failure to observe this 
decay (in comparison with the commonly observed decay w~ > e +0, + v,, 
which conserves both muon-type and electron-type lepton number) suggests the 
need for the different kinds of lepton numbers. We call these lepton numbers 


L,,L,,, and L,, and we have the following conservation law for leptons: 
In any process, the lepton numbers for electron-type leptons, muon-type 
leptons, and tau-type leptons must each remain constant. 


The following examples illustrate the conservation of these lepton numbers. 


vo + p> et +o 
L,: -1 + 07> -1 + 0 
y + b> bh + Pp 
igi Ve 0 > 1 + Oo 
wo>e +t Ve + vy, 
L.: 0 > 1 4+ (1) + 0 
Lh: 1 > 0 + 0 4 1 
tT o> mh + Vy 
Lys Dos 1 ae eT) 


Studying these examples, we can understand why sometimes neutrinos appear 
and sometimes antineutrinos appear. 


Baryon Number Conservation 


Baryons are subject to a similar conservation law. All baryons are assigned a 
baryon number B = +1, and all antibaryons are assigned B = —1. All nonbaryons 
(mesons and leptons) have B = 0. We then have the law of conservation of baryon 
number: 


In any process, the total baryon number must remain constant. 


(The conservation of nucleon number 4 is a special case of conservation of 
baryon number, in which all the baryons are nucleons. In particle physics, it is 
customary to use B instead of A to represent all baryons, including the nucleons.) 
No violation of the law of baryon conservation has ever been observed, although 
the Grand Unified Theories (see Section 14.8) suggest that the proton can decay 
in a way that would violate conservation of baryon number. 

As an example of conservation of baryon number, consider the reaction that 
was responsible for the discovery of the antiproton: 


prp>ptptptp 


On the left side, the total baryon number is B = +2. On the right side, we 
have three baryons with B= +1 and one antibaryon with B= —1, so the 
total baryon number is B = +2 on the right side also. On the other hand, the 


reaction p + p > p+p-+n violates baryon number conservation and is therefore 
forbidden. 


Strangeness Conservation 


The number of mesons that can be created or destroyed in decays or reactions 
is not subject to a conservation law like the number of leptons or baryons. For 
example, the following reactions can be used to produce pions: 


ptp—->ptntat ptp—> ptn+xzt+r? 
ptp>ptp+zx° ptp>ptp+xat+a 


As long as enough energy is available, any number of pions can be produced in 
these reactions. 

If we try the same type of reaction to produce K mesons, a different type of 
behavior is observed. The reactions p+ p > p+n+K* andp+p— p+p+K? 
never occur, even though the incident proton is given enough energy to produce this 
particle. We do, however, observe reactions suchasp + p > p+n+K*+K and 
pt+p—p+p+K*t+K_, which are very similar to the reactions that produce 
two pions. Why do reactions producing 7 mesons give any number (odd or even) 
but reactions producing K mesons give them only in pairs? 

Here’s another example of this unusual behavior. The reaction 77 + p—> 
z+ + X~ conserves electric charge and baryon number and so would be expected 
to occur, but it does not. Instead, the following reaction is easily observed: 
mz +p— K*t+D-. Usually when we fail to observe a reaction or decay 
process that is expected to occur, we look for the violation of some conservation 
law such as electric charge or baryon number. Is there a new conserved quantity 
whose violation prohibits the reaction from occurring? 

There are also decay processes that suggest that our labeling of the particles is 
incomplete. The uncharged n and zr? mesons decay very rapidly (107! — 1078 s) 
into two photons; on the basis of the systematic behavior of mesons, we would 
expect the K° to decay similarly to two photons in a comparable time. The 
observed decay of the K° takes place much more slowly (107!° s); moreover, the 
decay products are not photons, but 2 mesons and leptons. Is a new conservation 
law responsible for restricting the decay of the K°? 

As a final example of the need for a new conservation law, the heavy charged 
mesons are all strongly interacting particles, and we expect them to decay into 
the lighter mesons through the strong interaction with very short lifetimes. For 
example, the decay pt — 2+ -+,° occurs in a lifetime of about 10~** s. But 
the decay Kt + 2+ +27° occurs very slowly, in a time of the order of 107° s, 
and in fact the different decay mode Kt > wt + v,, is more probable. What is 
responsible for slowing the decay of the K meson by 15 orders of magnitude? 

These unusual behaviors are explained by the introduction of a new conserved 
quantity. This quantity is called the strangeness S, and we can use it to explain 
the properties of the K-meson decays. The K® and K* are assigned strangeness of 
S = +1; the z mesons and leptons are nonstrange particles (S = 0). The decay 
K° > y + y, which is an electromagnetic decay (as indicated by the photons), is 
forbidden because the electromagnetic interaction conserves strangeness (S = +1 
on the left, S = 0 on the right). The decay Kt — 2+ + 2° does not occur in the 
typical strong interaction time of 10~7> s because the strong interaction cannot 
change strangeness. It occurs in the typical weak interaction time of 107° s (and the 
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| Example 14.1 


corresponding weak interaction decay Kt > yt + v, occurs as often) because 
the weak interaction does not conserve strangeness; decays that are caused by the 
weak interaction can change the strangeness by one unit. 

We can summarize these results in the Jaw of conservation of strangeness: 


In processes governed by the strong or electromagnetic interactions, the 
total strangeness must remain constant. In processes governed by the weak 
interaction, the strangeness either remains constant or changes by one unit. 


The strangeness quantum numbers of the mesons and baryons are given in 
Tables 14.5 and 14.6. The strangeness of an antiparticle has the opposite sign to 
that of the corresponding particle. 

Conservation of strangeness in the strong interaction explains why the K 
mesons are always produced in pairs in proton-proton collisions. The protons 
and neutrons are non-strange particles (S = 0), so the only way to conserve 
strangeness in the collisions that produce K mesons is to produce them in pairs, 
always one with S = +1 and the other with § = —1. 

The baryons also come in strange and nonstrange varieties. Looking at the 
lifetimes in Table 14.6, we see that the A° decays into p +7 with a lifetime 
of about 107! s, while we would expect a strongly interacting particle to decay 
to other strongly interacting particles with a lifetime of about 10’ s. If the 
strangeness of the A° is assigned as —1, these decays change S and are forbidden 
to go by the strong interaction, and so must be due to the weak interaction, 
with the characteristic 10~!°s lifetime. The strangeness violation also tells us 
why the electromagnetic decay A® + n+ y does not occur (while the decay 
»° — A° + y does occur, with a typical electromagnetic lifetime of 107!” s). It 
also explains why the reaction 7~ + p> 2* + D7, which is permitted by all 
other conservation laws. is never observed—the initial state has S = 0 and the 
final state has S = —1, so it violates strangeness conservation. 

The weak interaction can change the strangeness by at most one unit. As 
a result, processes such as 53° > n+7° (S =—2— S=0) are absolutely 
forbidden, even by the weak interaction. 





The Q7 baryon has S = —3. (a) It is desired to produce the 
Q~ using a beam of K~ incident on protons. What other 
particles are produced in this reaction? (b) How might the 
Q7 decay? 


Solution 
(a) Reactions usually proceed only through the strong 
interaction, which conserves strangeness. We consider the 
reaction 


K°>+p7 Q°4+? 
On the left side, we have S = —1,B = +1, and electric 


charge O = 0. On the right side, we have S = —3, B= +1, 
and Q=-—1. We must therefore add to the right side 


particles with S=+2, B=0, and Q=+1. Scanning 
through the tables of mesons and baryons, we find that we 
can satisfy these criteria with K+ and K®, so one possible 
reaction is 


Kap> 2°46 4k 


(b) The Q7 cannot decay by the strong interaction, because 
no S = —3 final states are available. It must therefore decay 
to particles having S = —2 through the weak interaction, 
which can change S by one unit. One of the product parti- 
cles must be a baryon in order to conserve baryon number. 
Two possibilities are 


Q- +> A°+K- and Q > B°4+n 
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14.4 PARTICLE INTERACTIONS AND DECAYS 


In this section we briefly summarize the properties of the elementary particles and 
how they are measured. 

Atoms and molecules can be taken apart relatively easily and nonviolently, 
enabling us to study their structure. However, the elementary particles, most 
of which are unstable and do not exist in nature, must be created in violent 
collisions. (The particle theorist Richard Feynman once compared this process 
with studying fine Swiss watches by smashing them together and looking at the 
pieces that emerge from the collision.) For this purpose we need a high-energy 
beam of particles and a suitable target of elementary particles. The only strongly 
interacting, stable elementary particle is the proton, and thus a hydrogen target is 
a logical choice. To get a reasonable density of target atoms, researchers often 
use liquid, rather than gaseous, hydrogen. 

For a suitable beam, we must be able to accelerate a particle to very high 
energies (so that the energy of the particle may be hundreds of times its rest 
energy mc’). A stable charged particle is the logical choice for the beam; stability 
is required because of the relatively long time necessary to accelerate the particle 
to high energies, and a charged particle is required so that electromagnetic fields 
may be used to accelerate the particle. Once again the proton is a convenient 
choice, and thus many particle physics reactions are produced using beams of 
high-energy protons. For example, at the Fermi National Accelerator Laboratory 
(Fermilab) near Chicago protons are accelerated to 1000 GeV (v/c = 0.9999996) 
around a track of radius 1000 m (Figure 14.1). 

One type of particle physics reaction can thus be represented as 


p+p-— product particles 





FIGURE 14.1 An aerial view of the Tevatron at the Fermi National Accelerator 
Laboratory. Beams of protons and antiprotons circulate in opposite directions 
around the 1-km ring and collide at two locations, at the upper center and lower 
left. (Courtesy Fermi National Accelerator Laboratory.) 





= 


Richard P. Feynman (1918-1988, 
United States). Seldom is one person 
known for both exceptional insights 
into theoretical physics and excep- 
tional methods of teaching first-year 
physics. He received the Nobel Prize 
for his work on the theory that couples 
quantum mechanics to electromag- 
netism, and his text and film Lectures 
on Physics give unusual perspectives 
to many areas of basic physics for 
undergraduates. 
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FIGURE 14.2 The production of sec- 
ondary particle beams. The magnet 
helps to select the mass and momen- 
tum of the desired particle. 


Among the product particles may be a variety of mesons or even heavier particles 
of the baryon family, of which the nucleons are the lightest members. The study 
of the nature and properties of these particles is the goal of particle physics. 

In many cases, conservation laws restrict the nature of the product particles, 
and it would be desirable to have other types of beams available. One possibility 
is indicated in Figure 14.2. A proton beam is incident on a target—the nature of 
the target is not important. Like Feynman’s Swiss watch parts, many different 
particles emerge. By suitable focusing and selection of the momentum, we can 
extract a beam of the secondary particles created in the reactions. The particle 
must live long enough to be delivered to a second target, which might be tens of 
meters away; even if the particle were traveling at the speed of light, it would 
need about 107’ s to make its journey. Although this is a very short time interval 
by ordinary standards, on the time scale of elementary particles, it is a very long 
time—in fact, none of the unstable mesons or baryons (except the neutron) lives 
that long. 

Although our efforts to make a secondary beam would seem to be in vain, we 
have forgotten one very important detail. The lifetime of the particle is measured 
in its rest frame, while we are observing its flight in the laboratory frame, in 
which the particle is moving at speeds extremely close to the speed of light. 
The time dilation factor results in a lifetime, observed in our frame of reference, 
which might be hundreds of times longer than the proper lifetime. This factor 
extends the range of available secondary beams to those particles with lifetimes 
as short as 10~!° s, and makes it possible to obtain secondary beams to study such 
reactions as 


mz +p — particles 
and 
K+p-— particles 


even though the proper lifetimes of the 2 and K are in the range of 10~!° to 
10-8 3, 


Detecting Particles 


Observing the products of these reactions, which may involve dozens of high- 
energy charged and uncharged particles, poses a great technological problem 
for the experimenter. The detector must completely surround the reaction area, 
so that particles are recorded no matter what direction they travel after the 
reaction. The particles must produce visible tracks in the detector, so that their 
identity and direction of travel can be determined. The detector must provide 
sufficient mass to stop the particles and measure their energy. A magnetic field 
must be present, so that the resulting curved trajectory of a charged particle 
can be used to determine its momentum and the sign of its charge. Figure 14.3 
shows tracks left in a bubble chamber, a large tank filled with liquid hydrogen 
in which the passage of a charged particle causes microscopic bubbles resulting 
from the ionization of the hydrogen atoms. The bubbles can be illuminated and 
photographed to reveal the tracks. Figure 14.4 shows a large detector system 
that is used both to display the tracks of particles and to measure their energies; 
Figure 14.5 shows a sample of the results that are obtained with this type 
of detector. 
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FIGURE 14.3 A bubble chamber photograph of a reaction between particles. 
At right is shown a diagram indicating the particles that participate in the 
reaction. An incident pion collides with a proton in the liquid hydrogen, 
producing a K® and a A°, both of which subsequently decay. (Photo Courtesy 


Lawrence Berkeley National Laboratory.) 





FIGURE 14.4 A large detector at the Tevatron at Fer- 
milab. The proton and antiproton beams travel along 
the central axis of the detector and collide in its interior. 
The arches that have been removed on either side are 
the calorimeter detectors that in operation are pushed 
together so they can record the energies of all parti- 
cles that leave the reaction. The inner detectors record 
the tracks of the particles. (Courtesy Fermi National 
Accelerator Laboratory.) 





+3 ‘meters —_—_———_ 


FIGURE 14.5 A sample of the tracks left by a multitude 
of particles from a single proton-antiproton collision 
recorded with the detector of Figure 14.4. A magnetic 
field causes the tracks to curve; the radius of curvature 
determines the momentum of the particle, and the 
direction of curvature determines the sign of the charge 
of the particle. The jets are showers of particles resulting 
from a quark or antiquark that is produced in the 
reaction. In this case the jets come from the top and 
antitop quarks. (Courtesy Fermi National Accelerator 
Laboratory.) 
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FIGURE 14.6 Three possible decays 
of an unknown particle into two z 
mesons. The direction and momentum 
of each wz meson are indicated. 


From a careful analysis of the paths of particles, such as those of Figures 14.3 
or 14.5, we can deduce the desired quantities of mass, linear momentum, and 
energy. The other important property we would like to know is the lifetime of the 
decay of the product particles, because many of the products are often unstable. If 
we know the speed of a particle, we can find its lifetime by simply observing the 
length of its track in a bubble chamber photograph. (Even for uncharged particles, 
which leave no tracks, we can use this method to deduce the lifetime, because the 
subsequent decay of the uncharged particle into two charged particles defines the 
length of its path rather clearly, as shown in Figure 14.3.) 

This method works well if the lifetime is of the order of 10~!%s or so, such 
that the particle leaves a track long enough to be measured (millimeters to 
centimeters). With careful experimental technique and clever data analysis, this 
can be extended to track lengths of the order of 10~° m, and so lifetimes down 
to about 107!°s can be measured in this way (with a little help from the time 
dilation factor). But many of our particles have lifetimes of only 10~** s, and a 
particle moving at even the speed of light travels only the diameter of a nucleus 
in that time! How can we measure such a lifetime? Furthermore, how do we even 
know such a particle exists at all? Consider the reaction 


a+p7>m+pt+x 


where x is an unknown particle with a lifetime of about 10773 s, which decays 
into two z mesons according to x + a +2. How do we distinguish the above 
reaction from the reaction 


t+p7oaw+pt+ra4t+7 


which leads to the same particles as actually observed in the laboratory? 

Experimental evidence suggests that the two z mesons in this type of reaction 
may combine for an instant (10~7*s) to form an entity with all of the usual 
properties of a particle—a definite mass, charge, spin, lifetime, etc. These states 
are known as resonance particles, and we now look at the indirect evidence from 
which we infer their existence. 

Suppose you receive a package in the mail from a friend. When you open 
it, you find it contains many small, irregular pieces of broken glass. How do 
you learn whether your friend sent you a beautiful glass vase that was broken in 
shipment or a package of broken glass as a practical joke? You try to put the pieces 
together! If the pieces fit together, it is a good assumption that the vase was once 
whole, although the mere fact that they fit together doesn’t prove that it was once 
whole. It’s just the simplest possible assumption consistent with our experience. 
(An alternative assumption that the pieces were manufactured separately and just 
happen by chance to fit together is highly improbable.) 

How then do we detect a “particle” that lives for only 10~** s? We look at its 
decay products (which live long enough to be seen in the laboratory), and putting 
the pieces back together, we infer that they once may have been a whole particle. 

For example, suppose in the laboratory we observe two mz mesons emitted 
as shown in Figure 14.6. We measure the direction of travel and the linear 
momentum of the z mesons as shown. A second and a third event each produces 
two z mesons as also shown in the figure. Are these three events consistent with 
the existence of the same resonance particle? 
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Let us assume that in each case, a particle moving at an unknown speed decayed 
into the two particles as shown, one with energy E; and momentum p, and the 
other with energy E, and momentum p,. Each decay must conserve energy and 
momentum, so we can use the decay information to find the energy E = E, + E, 
and momentum p= p, + p> of the decaying particle, and then we can find 
its rest energy according to mc? = VE? — cp? = \/(E, + E))? — c2(p, + py)”. 
Carrying out the calculation, we find that, for the decay shown in part (a) of 
Figure 14.6, mc? = 764 MeV, while for part (b), mc? = 775 MeV. It is therefore 
possible that these two events result from the decays of identical particles. Part 
(c) of the figure gives mc? = 498 MeV, which differs considerably from parts 
(a) and (5). 

Of course, these two events are not sufficient to identify conclusively the 
existence of a resonance particle with a rest energy in the range of 770 MeV. It 
could be a mere accident, just like the chance fitting together of two pieces of 
broken glass. What is needed is a large (statistically significant) number of events, 
in which we can combine the energy and momenta of the two emitted 7 mesons 
in such a way that the deduced mass of the resonance particle is always the same. 
Figure 14.7 is an example of such a result. There is a background of events with 
a continuous distribution of energies, like beta decay electrons; these come from 
events like part (c) of Figure 14.6. There is also present a very prominent peak 
at 775 MeV. We identify this energy as the rest energy of the resonance particle, 
which is known as the p (rho) meson. (How do we know it is a meson? It must 
be a strongly interacting particle, because it decays so rapidly. Therefore the only 
possibilities are mesons, with integral spin, or baryons, with half-integral spin. 
Pi mesons have integral spin, and two integral spins can combine to give only 
another integral spin, so it must be a meson.) 









































100 
p-meson 
80 resonance 
g m+p—-a+ptp 
2 L+ atx 
= 60 
3 AE = 150 MeV 
= 
e 
o 40 A ima 
"4 te 
€ as 
7 + ere a, 
90 aa 5 
O 
300 500 700 900 1100 1300 1500 


[(Ey + Ep) — cB, + By)71"? (MeV) 


FIGURE 14.7 The resonance identified as the meson. The horizontal axis shows the 
energy and momentum of the two decay z mesons, combined to be equivalent to the mass 
of the resonance particle. 
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We can also infer the lifetime of the particle from Figure 14.7. The particle 
lives only for about 10773 s, and so if we are to measure its rest energy we 
have only 10-7 s in which to do it. But the uncertainty principle requires that 
an energy measurement made in a time interval At be uncertain by an amount 
roughly AE = f/At. This energy uncertainty AE is observed as the width of the 
peak in Figure 14.7. We don’t always deduce the same value 775 MeV for the rest 
energy of the ¢ meson; sometimes our value is a bit larger and sometimes a bit 
smaller. The width of the resonance peak tells us the lifetime of the particle. (The 
width is not really precisely defined, but physicists usually take as the width the 
interval between the two points where the height of the resonance is one-half its 
maximum value above the background, as shown in Figure 14.7.) The width of 
AE = 150 MeV leads to a value of At = i/AE = 4.4 x 10~**s for the lifetime 
of the p meson. 


14.5 ENERGY AND MOMENTUM IN 


PARTICLE DECAYS 





In analyzing the decays and reactions of elementary particles, we apply many 
of the same laws that we used for nuclear decays and reactions: energy, linear 
momentum, and total angular momentum must be conserved, and the total value 
of the quantum numbers associated with electric charge, lepton number, and 
baryon number (which we previously called nucleon number) must be the same 
before and after the decay or reaction. In reactions of elementary particles, we 
are often concerned with the production of new varieties of particles. The energy 
necessary to manufacture these particles comes from the kinetic energy of the 
reaction constituents (usually the incident particle); this energy is usually quite 
large (hence the name high-energy physics for this type of research), so relativistic 
equations must be used for energy and momentum. 

The decays of elementary particles can be analyzed in a way similar to the 
decays of nuclei, following the same two basic rules: 


1. The energy available for the decay is the difference in rest energy between 
the initial decaying particle and the particles that are produced in the decay. 
By analogy with our study of nuclear decays, we call this the O value: 


QO = (m, — m,)c* (14.1) 


where m;c” is the rest energy of the initial particle and mec? is the total rest 
energy of all the final product particles. (Of course, the decay will occur only 
if Q is positive.) 

2. The available energy Q is shared as kinetic energy of the decay products in 
such a way as to conserve linear momentum. As in the case of nuclear decays, 
for a decay of a particle at rest into two final particles, the particles have equal 
and opposite momenta, and we can find unique values for the energies of the 
two final particles. For decays into three or more particles, each particle has 
a spectrum or distribution of energies from zero up to some maximum value 
(as was the case with nuclear beta decay). 


| Example 14.2 
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Compute the energies of the proton and z meson that result 
from the decay of a A at rest. 


Solution 
The decay process is A° > p + 2~. Using the rest energies 
from Tables 14.5 and 14.6, we have: 


OQ = (my — my, — m,-)C 
= 1116MeV — 938 MeV — 140 MeV 
= 38 MeV 


and so the total kinetic energy of the decay products 
must be: 


K, + K, = 38MeV 


Using the relativistic formula for kinetic energy, we can 
write this as 


_(/ 2 
K,+kK, = ( Cp. mee — mc ) 
= (Je 2 + m2ct — m,c?) = 38 MeV 


Conservation of momentum requires p, = p,,. Substituting 
for one of the unknown momenta in the above equation 
and solving algebraically for the other, we obtain 


Px = Py = 101 MeV/c 
The kinetic energies can be found by substituting these 


momenta into the relativistic formula: 


K, = 33 MeV and =k, =5MeV 





| Example 14.3 





What is the maximum kinetic energy of the electron emitted 
in the decay w~ > e +0, + v,,? 


Solution 

The Q value for this decay is O=m,c 
105.2 MeV, because the neutrinos have negligible rest 
energy. If the yz~ is at rest, this energy is shared by the 
electron and the neutrinos: O = K, + Ej, + £,. When the 
electron has its maximum kinetic energy, the two neutrinos 
carry away the minimum energy. This minimum cannot be 
zero, because that would violate momentum conservation: 
the electron would be carrying momentum that would not 
be balanced by the neutrino momenta to give a net of zero 
(the ~ is at rest, so >> p; = > pp = 0). We assume that 
the electron has its maximum energy when the neutrinos 
are emitted in exactly the opposite direction to the elec- 
tron; otherwise some of the decay energy is “wasted” by 
providing transverse momentum components for the neu- 
trinos, and not as much energy will be available for the 
electron. It does not matter which of the neutrinos carry 
the energy and momentum (they may even share it in any 
proportion), so we let Z,, and p,, be the total neutrino energy 
and momentum; these are of course related by E,, = cp,, 


2 2 
-mM.Cc = 


because neutrinos are presumed to be of negligible mass 
and thus to travel at nearly the speed of light. If we let £, 
and p, represent the energy and momentum of the electron, 
then linear momentum conservation gives 


Pe- Py =9 


For the electron, E, = 
equations give: 


c*pz + méc*. Together, these 


O=K,+£, =E£,—m,c’ + cp, = E,— mc’ + ep, 


= 2 / 72 24 
= E, — m,c* + ,/ Es — mec 


Solving, we find E, = Q/2m,,c? +m,c’ and so 
K, = E, — mc’ = QO? /2m,,c” = 52.3 MeV 


The original rest energy of the z~ is shared essentially 
equally by the electron and the two neutrinos in this case: 
(Ke) max = (Ey) max = Q/2. Note how different this is from 
the case of the beta decay of the neutron, where the heavy 
proton resulting from the decay could absorb considerable 
recoil momentum at a cost of very little energy, so nearly 
all of the available energy could be given to the electron, 
and in that case (K.) max = Q. 
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| Example 14.4 





Find the maximum energy of the positrons and of the z 
mesons produced in the decay Kt > 29+ e+ + v,. 


Solution 
The Q value for this decay is 


O = (mg —m,, — m,)c2 = 494 MeV— 135 MeV—0.5 MeV 
= 358.5 MeV 


This energy must be shared among the three products: 
QO=K,+K,+E£, 


The electron and zx meson have their maximum energies 
when the neutrino has negligible energy: O = K, + Kg, 
and conservation of momentum in this case (if the neu- 
trino has negligible momentum) requires p,, = p,. Using 
relativistic kinetic energy, we have 


Q=K, +K, = (pe? + (m2)? — mac” 
+ ,/(pe)? + (m,c?)? — mc? 


where p = p, = p,. Inserting the numbers, we obtain 





494 MeV = Fi (pc)? + (135 MeV)? + Fi (pe)? + (0.5 MeV)? 


Clearing the two radicals involves quite a bit of alge- 
bra, but we can simplify the problem if we inspect this 
expression and notice that the solution must have a large 
value of pc, certainly greater than 100 MeV. (Otherwise 
the two terms could not sum to nearly 500 MeV.) Thus 
(pc)* > (0.5 MeV)’, and we can neglect the electron rest 
energy term in the second radical, which simplifies the 
equation somewhat: 





494 MeV = Fi (pc)? + (135 MeV)? + pe 


Solving, we find pe = 229 MeV, which gives (E.) max = 
229 MeV and (E,,) max = 266 MeV. Figure 14.8 shows the 
observed energy spectra of et and x° from the K* decay, 
and the energy maxima are in agreement with the calcu- 
lated values. (The shapes of the energy distributions are 
determined by statistical factors, as in the case of nuclear 
beta decay. The statistical factors are different for e* and 
7°, because the zr° also has its maximum energy when the 
et appears at rest and the v carries the recoil momentum.) 

You should repeat this calculation and convince your- 
self that (1) the 2° has its maximum energy also when 
K,=0 (E, =m,c*) and (2) the e+ does not have its 
maximum energy when K, = 0. 
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FIGURE 14.8 The spectrum of positrons and x mesons from 
the decay of the Kt meson. 





14.6 ENERGY AND MOMENTUM IN PARTICLE 


REACTIONS 





The basic experimental technique of particle physics consists of studying the prod- 
uct particles that result from a collision between an incident particle (accelerated 
to high energies) and a target particle (often at rest). The kinematics of the reaction 
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process must be analyzed using relativistic formulas, because the kinetic energies 
of the particles are usually comparable to or greater than their rest energies. In 
this section we derive some of the relationships that are needed to analyze these 
reactions, using the formulas for relativistic kinematics we obtained in Chapter 2. 
An important purpose of these reactions is the production of new varieties of 
particles, so we concentrate on calculating the threshold energy needed to produce 
these particles. (You might find it helpful to review the discussion in Chapter 13 
on nonrelativistic reaction thresholds.) 
Consider the following reaction: 


m, +m,>m,+mz+ms+-:-:- 


where the m’s represent both the particles and their masses. Any number of 
particles can be produced in the final state. Here m, is the incident particle, 
which has total energy E,, kinetic energy K; = E, — m,c?, and momentum 
cp, = Ei — mic4 in the laboratory frame of reference. The target particle m, 
is at rest in the laboratory. Figure 14.9 illustrates this reaction in the laboratory 
frame of reference. 

Just as we did for nuclear reactions, we define the OQ value to be the difference 
between the initial and final rest energies: 


O = (m, — mg)c* = [m, +m) — (m3 + mg + ms +++ -J]c? (14.2) 


If O is positive, rest energy is turned into kinetic energy, so that the product 
particles m3,m4,ms,... have more combined kinetic energy than the initial 
particles m, and m. If Q is negative, some of the initial kinetic energy of m, is 
turned into rest energy. 


| Example 14.5 
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Before £1 Ep = mgc 
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FIGURE 14.9. A reaction between 


particles in the laboratory reference 
frame. 





Compute the QO values for the reactions (a) 77 +p— (d) 
Ko + Ao) K-44 p = AP eo”. 


Solution 


i t ies from Tables 14. 14. 
(a) Using rest energies from Tables 14.5 and 14.6, — 181 Mev 


Q = [mg- + my — (mo + m,,0) |e? 
= 494 MeV + 938 MeV — 1116 MeV — 135 MeV 


QO = [m,- +m, — (mo + my) |? 
= 140 MeV + 938 MeV — 498 MeV — 1116 MeV 
= —536 MeV 


This reaction has a negative O value, and energy must be 


A positive Q value indicates that there is enough rest energy 
in the initial particles to produce the final particles; in fact 
there is 181 MeV of energy (plus the kinetic energy of the 
incident particle) left over for kinetic energy of the A° and 


mw, 


supplied in the form of initial kinetic energy to produce the 
additional rest energy of the products. 





Threshold Energy 


When the Q value is negative, there is a minimum kinetic energy that m, must 
have in order to initiate the reaction. As in the non-relativistic nuclear physics 
case, this threshold kinetic energy Ky, is larger than the magnitude of Q. The Q 
value is the energy necessary to create the additional mass of the product particles, 
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Ey E = myc? 
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FIGURE 14.10 The reaction of 
Figure 14.9 when m, has the threshold 
kinetic energy. The product particles 
move together as a unit in the direction 
of the original momentum. 


Before 


After 


but to conserve momentum the product particles cannot be formed at rest, so 
the threshold energy must not only create the additional particles but must also 
give them sufficient kinetic energy so that linear momentum is conserved in the 
reaction. 

If Figure 14.9 represents a reaction with a negative QO value, clearly the reaction 
is not being done at the threshold kinetic energy. In the reaction as it is drawn, 
not only have the new particles been created, they have been given both forward 
momentum (to the right in the figure), which is necessary to conserve the initial 
momentum of m,, as well as transverse momentum. This transverse momentum, 
which must sum to zero in order to conserve momentum, is not necessary either 
to create the particles or to satisfy conservation of momentum. At the minimum 
or threshold condition, this transverse momentum is zero. 

Also at threshold, the most efficient way to provide momentum to the final 
particles is to have them all moving together with the same speed, as in 
Figure 14.10. (This is equivalent to creating the particles at rest if we examine 
the collision from a reference frame in which the total initial momentum is zero, 
such as in a head-on collision of two particles.) Let’s represent the bundle of 
final particles, all moving as a unit, as a total mass M. Then conservation of 
momentum (Pinitiat = Pfinal) GIVES P; = Py and conservation of total relativistic 
energy (Ejnitias = Efnai) gives LE, + Ey = Ey, where py and Ey represent the 
momentum and total relativistic energy of the final bundle of particles. Then 


/ (pie)? + (mc?)? + myc? = J@uc? + (myc?) = Joe? + (myc?)? 


(14.3) 





Squaring both sides and solving, we obtain 


22 22 _ 22 
2 


The threshold kinetic energy of m, is then 


Ky = £E,-—mc = c)? + (m,c?)? =m, c7 
th 17m, 1 1 1 


22 2)2 2\2 
_ (Me*)*— (myc?) = (mge*) ioe? (14.5) 








2 
2myc 


_ (Mc* — myc? — myc*)(Mc? + mc? + myc’) 


2 
2myc 





With O = mc? + m)c* — Mc? and M = m, +m, +m, +---, this becomes 


m, +m, +m3z+My,+mMs5+--- 








Ky, = (-Q) 5 (14.6) 
ie) 
This can also be written as 
total mass of all particles involved in reaction 
Kw = (-Q) (14.7) 


2 x mass of target particle 


In the limit of low speeds, the relativistic threshold formula reduces to the non- 
relativistic formula for nuclear reactions derived in Chapter 13 (see Problem 20). 
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| Example 14.6 





Calculate the threshold kinetic energy to produce 7 mesons 





from the reaction p + p> p+p+7°. Ky = (0) etm 
2 
Solution ans 
Th lue i 4(938 MeV) + 135 MeV 
e Q value is = (135 MeV) ( ever ©* = 280 MeV 
2(938 MeV) 


O= Myc + Myc _ (mc + Myc + m,,C”) 
= —m,,c? = —135MeV Such energetic protons are produced at many accelerators 


throughout the world, and as a result the properties of the 
Using Eq. 14.7 we can find the threshold kinetic energy: gt mesons can be carefully investigated. 





| Example 14.7 





In 1956 an experiment was performed at Berkeley to search Thus 
for the antiproton, which could be produced in the reaction 
p+p—>p+p+p-+p. What is the threshold energy for 

this reaction? 


2 6m,c > 
Ky = (2m,c ms => 6m,c => 5628 MeV 


Solution = 5.628 GeV 
The rest energy of the antiproton is identical to the rest 


energy of the proton (938 MeV), so the Q value is For the discovery of the antiproton produced in this reac- 


P ‘ 5 3 tion, Owen Chamberlain and Emilio Segré were awarded 
Q= mc + myc — Amc") = —2myc the Nobel Prize in physics in 1959. 





It is interesting to compute the “efficiency” of these reactions—that is, how 
much of the initial kinetic energy we supply actually goes into producing the 
final particles, and how much is “wasted” in the laboratory kinetic energies of the 
reaction products. In the first example, we supply 280 MeV of kinetic energy to 
produce 135 MeV of rest energy, for an efficiency of about 50%. In the second 
example, 6m,c* of kinetic energy produces only 2m,c? of rest energy, for an 
efficiency of only 33%. As the rest energies of the product particles become 
larger, the efficiency decreases, and relatively more energy must be supplied. For 
example, to produce a particle with a rest energy of 50 GeV in a proton-proton 
collision, we need to supply about 1250 GeV of initial kinetic energy. Only 4% of 
the energy supplied actually goes into producing the new particles; the remaining 
96% must go to kinetic energy of the products in order to balance the large initial 
momentum of the incident particle. To produce a 100-GeV particle requires not 
twice as much energy, but four times as much. This is obviously not a pleasant 
situation for particle physicists, who must build increasingly more powerful 
accelerators to accomplish their goals of producing more massive particles. 

One way out of this difficulty would be to do an experiment in which two 
particles with equal and opposite momentum collide head-on. In effect, we would 
be doing this experiment in the center-of-mass (CM) frame, where at threshold the 
production of new particles is 100% efficient—anone of the initial kinetic energy 
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FIGURE 14.11 The relationship be- 
tween electric charge and strangeness 
for the spin-0 mesons. 
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FIGURE 14.12 The relationship be- 
tween electric charge and strangeness 
for the spin-+ baryons. 
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FIGURE 14.13 The relationship be- 
tween electric charge and strangeness 
for the spin-3 baryons. 


goes into kinetic energy of the products, which are produced at rest in the CM 
frame. Thus a 50-GeV particle could be produced by a head-on collision between 
two protons with as little as 25 GeV of kinetic energy. Of course, this great gain in 
efficiency is at a cost of the technological difficulty of making such collisions occur. 

There are now colliding beam accelerators in operation, in which beams of 
particles (such as electrons or protons) can occasionally be made to collide. 
For example, in the Fermilab accelerator (Figure 14.1), beams of protons and 
antiprotons (each of energy 1 TeV = 1000 GeV) circulate around the ring in 
opposite directions and collide twice during each revolution. The Large Hadron 
Collider (hadron meaning strongly interacting particles), which is on the border 
between Switzerland and France, became operational in 2009; it collides two 
beams of protons each at an energy of 7 GeV in order to search for new particles 
in an even higher range of rest energies. Other colliding beam accelerators bring 
together electrons and positrons at energies of 50 to 100 GeV. In each case, all of 
the available energy can go into the production of new particles. 


14.7 THE QUARK STRUCTURE OF MESONS 
AND BARYONS 





Although the classifications and properties of the elementary particles seem like a 
complicated and disordered collection, there is an underlying order that suggests 
that a scheme of remarkable simplicity is at work. We can illustrate this order 
if we plot a diagram that has strangeness along the y axis and electric charge 
along the x axis. If the families of particles are placed in their proper locations on 
the graphs, regular geometrical patterns begin to emerge. Figures 14.11 to 14.13 
show such plots for the lower mass spin-0 mesons, the spin-5 baryons, and the 
spin-3 baryons. In 1964, Murray Gell-Mann and George Zweig independently 
and simultaneously recognized that such regular patterns are evidence of an 
underlying structure in the particles. They showed that they could duplicate these 
patterns if the mesons and baryons were composed of three fundamental particles, 
which soon became known as quarks. These three quarks, known as up (u), down 
(d), and strange (s), have the properties listed in Table 14.7. We will shortly see 
that we now believe that six quarks are necessary to account for all known mesons 
and baryons. 

Let us see how the quark model works in the case of the spin-0 mesons. The 
quarks have spin 5, so the simplest scheme to form a spin-0 meson would be to 
combine two quarks, with their spins directed oppositely. However the mesons 
have baryon number B = 0, while a combination of two quarks would have 
B= 5 + t = 5. A combination of a quark and an antiquark, on the other hand, 
would have B = 0, because the antiquark has B = — 7 

For example, suppose we combine a u quark with a d (“antidown”) quark, 
obtaining the combination ud. This combination has spin zero and electric charge 
ze + fe = +e. (A d quark has charge — fe, so d has charge + fe.) The properties 
of this combination are identical with the z+ meson, and so we identify the z+ with 
the combination ud. Continuing in this way, we find nine possible combinations 
of one of the three original quarks from Table 14.7 with an antiquark, as listed in 
Table 14.8, and plotting those nine combinations on a graph of strangeness against 
electric charge, we obtain Figure 14.14, which looks identical to Figure 14.11. 


TABLE 14.7 Properties of the Three Original Quarks 
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Charge | Spin | Baryon 
Name | Symbol (e) (4) | Number | Strangeness | Antiquark 
Up u +3 5 +4 0 a 
1 1 1 = 
Strange S - ; 5 + ; -1 Ss 








Values shown for the charge, baryon number, and strangeness are those for the quark; 
values for the antiquark have the opposite sign. 


TABLE 14.8 Possible Quark-Antiquark Combinations 















































Combination | Charge (e) | Spin (#) | Baryon Number | Strangeness 
uu 0 0, 1 0 0 
ud +1 0, 1 0 0 
us +1 0, 1 0 +1 
du —1 @, il 0 0 
dd 0 0,1 0 0 
ds 0 @, 1 0 +1 
su -1 0, 1 0 -1 
sd 0 0, 1 0 -1 
ss 0 0, 1 0 0 





The baryons have B = +1 and spin - or 3, which suggests immediately that 
three quarks make a baryon. The 10 possible combinations of the three original 
quarks are listed in Table 14.9, and we can arrange them into two patterns as 
shown in Figures 14.15 and 14.16, which are identical to those for the spin-5 and 
spin-3 baryons. 

Using the quark model, we can analyze the decays and reactions of the 
elementary particles, based on two rules: 


1. Quark-antiquark pairs can be created from energy quanta, and conversely can 
annihilate into energy quanta. For example, 


energy > u+U or d+d-— energy 
This energy can be in the form of gamma rays (as in electron-positron 
annihilation), or else it can be transferred to or from other particles in the 
decay or reaction. 

2. The weak interaction can change one type of quark into another through 
emission or absorption of a W* or W~, for example s > u+ W~-. The W 
then decays by the weak interaction, suchas W” > pw” +v,orW° > d+U. 
The strong and electromagnetic interactions cannot change one type of quark 
into another. 
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TABLE 14.9 Possible Three-Quark Combinations 












































Combination | Charge(e) | Spin(#) | Baryon Number | Strangeness 
uuu +2 3 +1 0 
uud +1 5.3 +1 0 
udd 0 533 +1 0 
uus ell 5 3 spill —l 
uss 0 5 3 +1 —2 
uds 0 5, - +1 —1 
ddd -1 3 +1 0 
dds -1 53 +1 -1 
dss —1 5, ; +1 —2 
SSS =| 3 srl =3) 











| Example 14.8 





Analyze (a) the reaction 7~ + p > A° + K® and (bd) the 
decay 7+ — yt + v,, in terms of the constituent quarks. 


Solution 
(a) The reaction m~ + p > A° + K® can be rewritten as 
follows: 

du + uud — uds + ds 


Each side contains one u quark and two d quarks, which 
don’t change in the reaction. Removing these “spectator” 
quarks from each side of the reaction, we are left with the 
remaining transformation: 


utu->s+s 


The u and u annihilate, and from the resulting energy s and 
S are created. 

(b) The z+ has the quark composition ud. There are no 
quarks in the final state (ut + V,,), $0 we must find a way 
to get rid of the quarks. One possible way is to change the 
u quark into a d quark: u > d+ W?. The net process can 
thus be written as 


ud > d+d+Wwt 


with the products then undergoing the following processes 
to produce the final observed particles: 


d+d— energy and Wt > pt +», 





You may have noticed that some of the heavier mesons listed in Table 14.5 
were not included in Figure 14.11, and they cannot be accounted for among the 
quark-antiquark combinations listed in Table 14.8. Where do these particles fit in 


our scheme? 


In 1974, a new meson called J/y was discovered at a rest energy of 3.1 GeV. 
(It was given different names J and w by the two competing experimental groups 
that first reported its discovery.) This new meson was expected to decay to lighter 
mesons in a characteristic strong interaction time of around 10~? s. Instead, its 
lifetime was stretched by 3 orders of magnitude to about 10-7" s, and its decay 
products were et and e~, which are more characteristic of an electromagnetic 
process. Why is the rapid, strong interaction decay path blocked for this particle? 
This was soon explained by assuming the J/y to be composed of a new quark c, 
called the charm quark, and its antiquark c. The existence of the c quark had 
been predicted 4 years earlier as a way to explain the failure to observe the decay 
K® — wt + pw, which violates no previously known law but is nevertheless not 


observed. 
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The c quark, which carries a charge of +e, has a property, charm, that operates 
somewhat like strangeness. We assign a charm quantum number C = +1 to the 
c quark (and assign C = —1 to its antiquark c). All other quarks are assigned 
C = 0. We can now construct a new set of mesons by combining the c quark with 
the U,d, and § antiquarks and by combining the ¢ antiquark with the u, d, and s 
quarks. Instead of nine spin-0 mesons, there are now 16, and the two-dimensional 
graphs of Figures 14.11 and 14.14 must be extended to a third dimension to show 
the C axis (Figure 14.17). All of these new mesons, called D, have been observed 
in high-energy collision experiments. Baryons containing this new quark have 
also been discovered, analogous to the A, ©, &, and Q particles but with an s 
quark replaced by ac quark. 

In 1977, the same sequence of events was repeated with another meson, YT 
(upsilon). The rest energy was determined to be about 9.5 GeV, and again its 
decay was slowed to about 10~° s and occurred into e+ + e7 rather than into 
mesons. Once again, a new quark was postulated: the b (bottom) quark with a new 
quantum “bottomness” number B = —1 and a charge of — fe. (The letter B is used 
to represent baryon number as well as bottomness. It should always be apparent 
from the discussion which one is meant.) The Y is assigned as the combination 
bb. Many new particles containing the b quark have been discovered, including 
B mesons (in which a b quark is paired with a different antiquark) and baryons 
similar to A, &, and & with a b quark replacing one of the s quarks. 

A sixth quark was discovered in 1994 in proton-antiproton collisions at 
Fermilab. These collisions created this new quark and its antiquark, both of which 
decayed into a shower of secondary particles (as in Figure 14.5). By measuring 
the energy and momentum of the secondary particles, the experimenters were 
able to determine the mass of the new quark to be 172 GeV (roughly the mass of 
a tungsten atom). This new quark is known as t (top) and has a new associated 
property of “topness” with a quantum number 7 = +1. 

It may now seem that we are losing sight of our goal to achieve simplicity 
(to add the “bottomness” axis to Figure 14.17 we would need to depict a four- 
dimensional space!) and that we are moving toward replacing a complicated array 
of particles with an equivalently complicated array of quarks. However, there is 
good reason to believe that there are no more than six fundamental quarks. In the 
next section, we discuss how we are indeed on the path to a simple explanation of 
the fundamental particles. 




















Electric charge 


FIGURE 14.17 The relationship among electric 
charge, strangeness, and charm for the spin-0 
mesons. 
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TABLE 14.10 Properties of the Quarks 
































Charge | Spin Baryon Rest Energy Properties 
Type Symbol | Antiparticle (e) (A) Number (MeV) C S T B 
Up u +3 5 +4 330 Oo | omieo 
Down d ot 5 +3 330 Os 0 | 0 20 
Charm c c +3 5 +3 1500 +1] 0 | 0 | 0 
Strange s S —+ +4 500 OS ese 
Top i +4 5 +4 172,000 0 | 0 | +1) 0 
Bottom b b -3 5 oes 4700 0) 0 | 0 | -1 





























Table 14.10 shows the six quarks and their properties. The masses of the quarks 
cannot be directly determined, because a free quark has yet to be observed. The 
rest energies shown in Table 14.10 are estimates based on the “apparent” masses 
that quarks have when bound in various particles. For example, the observed rest 
energy of the proton is the sum of the rest energies of its three quark constituents 
less the binding energy of the quarks. Since we don’t know the binding energy, 
we can’t determine the rest energy of a free quark. The rest energies shown in 
Table 14.10 are often called those of constituent quarks. 

The quark model does a great deal more than allow us to make geometrical 
arrangements of particles such as Figure 14.17. It can be used to explain many 
observed properties of the particles, such as their masses and magnetic moments, 
and to account for their decay lifetimes and reaction probabilities. Nevertheless, 
a free quark has never been observed, despite heroic experiments to search for 
them. How can we be sure that they exist? In experiments that scatter high-energy 
electrons from protons, we observe more particles scattered at large angles than 
we would expect if the electric charge of the proton were uniformly distributed 
throughout its volume, and from the analysis of the distribution of the scattered 
electrons we conclude that inside the proton are three point-like objects that are 
responsible for the scattering. This experiment is exactly analogous to Rutherford 
scattering, in which the presence of the nucleus as a compact object inside the atom 
was revealed by the distribution of scattered alpha particles at angles larger than 
expected. Like Rutherford’s experiment, the observed cross section depends on 
the electric charge of the object doing the scattering, and from these experiments 
we can deduce charges of magnitude ie and ze for these point-like objects. These 
experiments give clear evidence for the presence of quarks inside the proton. 

We don’t yet know why free quarks have not been observed. Perhaps they 
are so massive that no accelerator yet built has enough energy to liberate one. 
Perhaps the force between quarks increases with distance (in contrast with 
electromagnetism or gravitation, which decrease with separation distance), so 
that an infinite amount of energy would be required to separate a quark from 
a nucleon. Or (as is now widely believed) perhaps the basic theory of quark 
structure forbids the existence of free quarks. 


Quarkonium 


The theoretical analysis of the structure of baryons poses mathematical difficulties 
that are characteristic of all three-body mechanical or quantum-mechanical 
systems. Instead, we can learn a bit about the interactions of quarks from 
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examining the properties of two-body systems, especially the quark-antiquark 
combinations in mesons. 

We'll look briefly at the combination of a quark with its own antiquark. The 
binding energies of quarks in mesons are very large (hundreds of MeV), so the 
quark-antiquark pairs of the light quarks (u, d, s) must be treated relativistically, 
because the binding energies and thus the kinetic energies in the bound state are 
roughly the same as the rest energies. However, for the more massive quarks (c, 
b, t), the binding energies are small compared with the rest energies and we can 
use nonrelativistic methods (such as the Schrédinger equation) for the analysis. 

There is a well-studied analogy for the properties of a quark-antiquark combina- 
tion. When a positron travels through matter, before it annihilates it forms an atom- 
like bound state with an electron. This bound electron-positron system is called 
positronium. The positron and the electron each orbit about their center of mass, in 
states that are similar to atomic states in hydrogen. We can label these states by the 
values of their total spin angular momentum S and orbital angular momentum L, as 
we did for atomic states in Chapter 8. (Don’t confuse these labels with strangeness 
and lepton number.) The spins of the two particles can be parallel (hence with 
total spin S = 1) or antiparallel (S = 0). The total orbital angular momentum of 
the two particles about the center of mass can be L = 0 for s states, L = | for p 
states, etc. Finally, the system can exist with different radial wave functions that 
we can label with principal quantum number n = 1, 2,3,..., exactly as we did in 
the hydrogen atom. Figure 14.18a shows some of the bound states in positronium. 

The positronium structure is very similar to the bound states in a quark- 
antiquark system, which is correspondingly known as quarkonium. Figure 14.185 
shows the quarkonium excited states for the cc system, and the states for the bb 
system are shown in Figure 14.18c. There is a great similarity between the excited 
states of the two quarkonium systems. Note in particular the states with S = 1 
and L = 0 in c¢ and bb. (These would be equivalent to the 1s, 2s, 3s, and 4s states 
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FIGURE 14.18 The energy levels of (a) positronium (e+e~), (b) cé quarkonium, and (c) bb quarkonium. The 
atom-like states are labeled with the value of the principal quantum number n. The zero of the energy scale is 
at 2980 MeV for ct and 9389 MeV for bb. The n = 2 states with S = 0,L = 0 and S = 0,L = 1 in bb have not 
yet been discovered. 
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in a hydrogenic system. The lowest S = 1, L = Occ state is the J/iyy meson, and 
the corresponding bb state is the Y meson.) Moreover, the overall structure of the 
excited states of the quarkonium systems is very similar to that of positronium 
(especially the n = 2 states of positronium and cc). 

Knowledge of the excited states of quarkonium allows us to guess at an 
effective potential energy for which we can solve the Schrédinger equation 
to try to calculate the energies of the states. The similarity with positronium 
certainly suggests that we try a Coulomb-like potential energy that depends on 
1/r. However, it cannot be an electromagnetic interaction—the electromagnetic 
interaction between two quarks separated by a distance on 0.5 fm (a typical size of 
a meson) is at most about | MeV, which is less than 1% of the energy differences 
of the quarkonium excited states shown in Figure 14.18. (Moreover, the cc 
electromagnetic interaction should be 4 times stronger than the bb electromagnetic 
interaction, but there is no evidence of this in Figure 14.18.) The 1/r interaction 
grows weaker with increasing separation, so we must add another term that grows 
stronger with separation, which accounts for the failure to produce a free quark. 
Several different potential energies have been tried for this additional term, the 
simplest being a term that is linear in the separation r. The net effective potential 
energy is then of the form 


U(r) = -“ + br (14.8) 


The Schrédinger equation can be solved numerically for this potential energy, 
with the constants a and 5 adjusted to give best agreement with experiment. The 
constant b turns out to have a value of about 1 GeV/fm. This large value is 
consistent with the failure to observe a free quark—to separate the quarks in a 
meson even to an atom-sized distance would require about 10° GeV, far greater 
than the beam energy of any accelerator. 

One of the especially interesting features of the quarkonium excited states 
shown in Figure 14.18 is the rough agreement between the energies of the cc and 
bb states. (See especially the four states with S = 1 and L = 0.) This is surprising, 
because in a simple two-body hydrogen-like system, the energies should depend 
on the masses of the orbiting particles, and the b quark is three times as massive 
as the c quark. It would be interesting to continue this comparison for the bound 
states of the tt system involving the top quark, but the difficulty of producing this 
particle in significant quantities (owing to its large mass) has so far prevented a 
study of the excited states of the tt system. 


14.8 THE STANDARD MODEL 


Ordinary matter is composed of protons and neutrons, which are in turn composed 
of uand d quarks. Ordinary matter is also composed of electrons. In the radioactive 
decay of ordinary matter, electron-type neutrinos are emitted. Our entire world 
can thus be regarded as composed of four spin-4 particles (and their antiparticles), 
which can be grouped into a pair of leptons and a pair of quarks: 


(e, Vv.) and (u, d) 


Within each pair, the charges of the two particles differ by one unit: —1 and 0, 
+5 and — 7 


When we do experiments with high-energy accelerators, we find new types 
of particles: muons and muon neutrinos, plus mesons and baryons with the new 
properties of strangeness and charm. We can account for the structure of these 
particles with another pair of leptons and another pair of quarks: 


(HM, Vy) and (c, s) 


Once again, the particles come in pairs differing by one unit of charge. 

At even higher energies, we find a new generation of particles consisting 
of another pair of leptons (tau and its neutrino) and a new pair of quarks (top 
and bottom), which permits us to continue the symmetric arrangement of the 
fundamental particles in pairs: 


(tT, v,) and (t, b) 


Is it possible that there are more pairs of leptons and quarks that have not 
yet been discovered? At this point, we strongly believe the answer to be “No.” 
Every particle so far discovered can be fit into this scheme of 6 leptons and 
6 quarks. Furthermore, the number of lepton generations can be determined by 
the decay rates of the heaviest particles, and a limit of 3 emerges from these 
experiments. Finally, according to present theories the evolution of the universe 
itself would have proceeded differently if there had been more than three types of 
neutrinos. For these reasons, it is generally believed that there are no more than 
three generations of particles. 

The strong force between quarks is carried by an exchanged particle, called 
the gluon, which provides the “glue” that binds quarks together in mesons and 
baryons. (There are actually eight different gluons in the model.) A theory known 
as quantum chromodynamics describes the interactions of quarks and the exchange 
of gluons. In this theory, the internal structure of the proton consists of three 
quarks “swimming in a sea” of exchanged gluons. Like the quarks, the gluons 
cannot be observed directly, but there is indirect evidence of their existence from 
a variety of experiments. 

The theory of the structure of the elementary particles we have described so far 
is known as the Standard Model. It consists of 6 leptons and 6 quarks (and their 
antiparticles), plus the field particles (photon, 3 weak bosons, 8 gluons) that carry 
the various forces. It is remarkably successful in accounting for the properties of 
the fundamental particles, but it lacks the unified treatment of forces we would 
expect from a complete theory. 

The first step toward unification was taken in 1967 with the development of 
the electroweak theory by Stephen Weinberg and Abdus Salam. In this theory, 
the weak and electromagnetic interactions are regarded as separate aspects of the 
same basic force (the electroweak force), just as electric and magnetic forces are 
distinct but part of a single phenomenon, electromagnetism. The theory predicted 
the existence of the W and Z particles; their discovery in 1983 provided a 
dramatic confirmation of the theory. 

The next-higher level of unification would be to combine the strong and 
electroweak forces into a single interaction. Theories that attempt to do this are 
called Grand Unified Theories (GUTs). By incorporating leptons and quarks into 
a single theory, the GUTs explain many observed phenomena: the fractional 
electric charge of the quarks and the difference of one unit of charge between the 
members of the quark and lepton pairs within each generation. The GUTs also 
predict new phenomena, such as the conversion of quarks into leptons, which 
would permit the proton (which we have so far assumed to be an absolutely stable 
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particle) to decay into lighter particles with a lifetime of at least 10°! y. Searches 
for photon decay (by looking for evidence of decays in a large volume of matter; 
see Figure 14.19) have so far been unsuccessful and have placed lower limits on 
the proton lifetime of at least 10°? y. 

A missing part of the Standard Model is an explanation of why the particles 
have the masses that we observe. A complete theory ought to be able to calculate 
the masses of the particles. It has been proposed that there is a field pervading 
the entire universe and that particles acquire their particular masses as a result 
of the strength of their interactions with this field, somewhat like a particle 
moving through a viscous medium seems to have more inertia and thus a greater 
“effective” mass. This field is known as the Higgs field and the particle that carries 
the field interaction is called the Higgs boson. This particle has been searched for 
but not yet found; estimates of its expected mass are in excess of 100 GeV/c’. 
The Large Hadron Collider, the world’s most powerful particle accelerator, is 
currently searching for evidence of the Higgs boson by colliding beams of protons 
at an energy of 7000 GeV. 

Another shortcoming of the Standard Model is that it is based on massless 
neutrinos. Although the upper limit (see Table 14.4) on the mass of the electron 
neutrino is very small (2 eV), the limits on the other neutrino masses are much 
larger. Measurements of the flux of neutrinos reaching Earth from the Sun, 
produced in the fusion reactions discussed in Chapter 13, have consistently 
revealed a large deficit—the intensity of electron neutrinos observed on Earth 
is only about 1/3 of what is predicted based on models of how fusion reactions 
occur in the Sun’s interior. Recent measurements at the Sudbury Neutrino 
Observatory in Canada have revealed that, although the intensity of electron 
neutrinos from the Sun is only 1/3 of the expected value, the total intensity 





FIGURE 14.19 The Superkamiokande detector system in Japan was 
designed to search for proton decay. The water tank, 40 m in diameter 
and located 1000 m underground, holds 50,000 tons of water. The tank 
is lined with more than 10,000 photomultiplier detectors that respond to 
flashes of light that would be emitted when one of the protons in the water 
decayed. Here the tank has been partly emptied so that the technicians (in 
the boat) can service the photomultipliers. 


of all neutrinos (including muon and tau neutrinos) reaching us from the Sun 
agrees with the predicted rate. This is very puzzling, because the fusion reactions 
in the Sun should produce only electron neutrinos; the reacting particles in the 
solar interior are not sufficiently energetic to produce mu and tau leptons. This 
mystery has been explained by proposing that the electron neutrinos are produced 
in the solar interior at the expected rate, but that during their journey from the 
Sun to Earth, the purely electron neutrinos become a mixture of roughly equal 
parts electron, muon, and tau neutrinos. This nicely explains why the rate of 
electron neutrinos from the Sun appears to be only about 1/3 of what is expected 
(the other 2/3 of the electron neutrinos having been converted into muon or tau 
neutrinos). This phenomenon of neutrino oscillation (which refers to neutrinos 
oscillating from one type to another) can occur only if the neutrinos have mass. 
The required masses are very small, well within the experimental limits, but the 
neutrino masses are definitely not zero. The Standard Model must be extended to 
include nonzero neutrino masses, and the rules for conservation of lepton number 
must be modified to allow one type of neutrino to transform into another. 

The search for a consistent explanation of the elementary particles has led 
physicists to work with exotic theories. In string theory the particles are replaced 
by tiny (10~*? cm) strings, whose vibrations give rise to the properties we observe 
as particles. These theories exist in spacetimes with 10 or more dimensions, and at 
present seem to be far beyond any possible experimental test. Another extension 
of the Standard Model is called supersymmetry; this theory proposes that there is 
a higher symmetry between the spin-4 particles (such as the quarks and leptons) 
and particles with integral spin, so that under this theory there would be electrons 
and quarks with a spin of 0 and W and Z particles and photons with a spin of 
7 The masses of these supersymmetric particles are estimated to be very much 
larger than their ordinary partners, perhaps in the range of 100 GeV/c’, but even 
in this range they should be observable through experiments currently planned at 
the Large Hadron Collider. 

There is so far no conclusive verification for any of the GUTs, nor is there 
a successful theory that incorporates the remaining force, gravity, into a unified 
theory. The quest for unification and its experimental tests remains an active area 
of research in particle physics. 
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10. 


Lit, 


14.1 


Some conservation laws are based on fundamental properties 
of nature, while others are based on systematics of decays 
and reactions and have as yet no fundamental basis. Give 
the basis for the following conservation laws: energy, lin- 
ear momentum, angular momentum, electric charge, baryon 
number, lepton number, strangeness. 

Does the presence of neutrinos among the decay products 
of a particle always indicate that the weak interaction is 
responsible for the decay? Do all weak interaction decays 
have neutrinos among the decay products? Which decay 
product indicates an electromagnetic decay? 

Do all strongly interacting particles also feel the weak 
interaction? 

In what ways would physics be different if there were 
another member of the lepton family less massive than the 
electron? What if there were another lepton more massive 
than the tau? 

Suppose a proton is moving with high speed, so that 
E> mc’. Is it possible for the proton to decay, such as 
inton-+2* orp+2°? 

On planet anti-Earth, antineutrons beta decay into antipro- 
tons. Is a neutrino or an antineutrino emitted in this decay? 
List some experiments that might distinguish antineutrons 
from neutrons. Among others, you might consider (a) neu- 
tron capture by a nucleus; (b) beta decay; (c) the effect of a 
magnetic field on a beam of neutrons. 

The »° can decay to A° without changing strangeness, so 
it goes by the electromagnetic interaction; the charged &* 
decay to p or n by the weak interaction in characteristic 
lifetimes of 10~-!°s. Why can’t X*+ decay to A° by the 
strong interaction in a much shorter time? 

The Q@ particle decays to A? +K~. Why doesn’t it also 
decay to A° +27? 

Explain why we do not account for the number of mesons in 
decays or reactions with a “meson number” in analogy with 
lepton number or baryon number. 

Consider that leptons and baryons both obey conservation 
laws and are both fermions; mesons do not obey a conser- 
vation law and are bosons. Can you think of another particle 
(other than a meson) that has integral spin and can be emitted 
or absorbed in unlimited numbers? 








The Four Basic Forces 

Identify the interaction responsible for the following decays 
(approximate half-lives are given in parentheses): 

(a) A* > p+a (10773 s) 

(6) n>y+y (107s) 


12. 


13. 


15. 


16. 


iP 


18. 


19. 


20. 


21, 


Can antibaryons be produced in reactions between baryons 
and mesons? 

List some similarities and differences between the properties 
of photons and neutrinos. 

Is it reasonable to describe a resonance as a definite particle, 
when its mass is uncertain (and therefore variable) by 20%? 
Why are most particle physics reactions endothermic 
(O <0)? 

Although doubly charged baryons have been found, no dou- 
bly charged mesons have yet been found. What would be 
the effect on the quark model if a meson with charge +2e 
were found? How could such a meson be interpreted within 
the quark model? 

All direct quark transformations must involve a change of 
charge; for example, u + d is allowed (accompanied by the 
emission of a W_), but s > d is not. Can you suggest a 
two-step process that might permit the transformation of an 
s quark into a d? 

The decay Kt > 1+ +e+ +e is at least five orders of 
magnitude less probable than the decay Kt > 2° + e+ + 
v,. Based on Question 17, can you explain why? 

The D mesons decay to z and K mesons with a lifetime 
of 10~!3 s. (a) Why is the lifetime so much slower than a 
typical strong interaction lifetime? Is a quantum number not 
conserved in the decay? (b) What interaction is responsible 
for the decay? 

The A* baryons are found with electric charges +2,+1, 0, 
and —1. Based on the quark model, why do we expect no 
A* with charge —2? 

Although we cannot observe quarks directly, indirect evi- 
dence for quarks in nucleons comes from the scattering 
of high-energy particles, such as electrons. When the de 
Broglie wavelength of the electrons is small compared with 
the size of a nucleon (~ 1 fm), the electrons appear to be 
scattered from massive, compact objects much smaller than 
a nucleon. To which phenomenon discussed previously in 
this text is this similar? Can the scattering be used to deduce 
the mass of the struck object? How does the scattering 
depend on the electric charge of the struck object? What 
would be the difference between scattering from a particle 
of charge e and one of charge ze? 





(c) Kt wt +, (108s) 

(d) Ao > p+a7 (107!%s) 

(e) n! > n+2m (107?! s) 

(f) K°>at4+a7 (107!°s) 

What is the range of the W~ particle that is responsible for 
the weak interaction of a proton and a neutron? 


14.2 
3. 


4. 


14.3 


14.4 
10. 


11. 


14.5 
12. 


13% 


Classifying Particles 


Give one possible decay mode of the following mesons: 


_ _ <0 
(a) a (6) p (c) D (@) K 
Give one possible decay mode of the following antibaryons: 
(a) ii (by x () & (d) = 


Suggest a possible decay mode for the K° meson that 
involves the emission of: 

(a) v, (b) Vz (c) v, (d) v, 

Is it possible to have a decay mode of the K® that involves 
the emission of v, or DV? 


Conservation Laws 


Name the conservation law that would be violated in each 
of the following decays: 
(a) m+ >et+y 
(b) A° > p+K~ 


Mos nt+y 

(f) Q- > B°4K- 

(c) Q> > D7 4+n7° (g) B° > 0°94 7° 

(d) MS > a7~+nt (hk) wooe ty 

Each of the following reactions violates one (or more) of 
the conservation laws. Name the conservation law violated 
in each case: 


(e) 


(a) v.+p>n+et 

(6) p+p—>ptn+kKt 

¢) p+p>ptptA+e’ 

(7) m+n>K-4+/A° 

(e) K>+pon+A? 

Supply the missing particle in each of the following decays: 
(a) Ko >? +e7 4 

(b) K®° > w®4n94 

(c) nonmtt+n7 + 

Each of the reactions below is missing a single particle. 
Supply the missing particle in each case. 

(a) ptp—> pt A% (d) K>+n—> AP+ 

(6) p+p>n+ (e) V,+tpan+ 

(c) m +p—> S°+K°+ (ff) Ko+poKt+ 


Particle Interactions and Decays 


Carry out the calculations of mc” for the three decays of 
Figure 14.6. 

Determine the energy uncertainty or width of (a) 7; (b) 7; 
(c) B®; (d) A*. 


Energy and Momentum in Particle Decays 


A =~ baryon is produced in a certain reaction with a kinetic 
energy of 3642 MeV. If the particle decays after one mean 
lifetime, what is the longest possible track this particle could 
leave in a detector? 

Repeat the calculation of Example 14.4 for the case in 
which the z meson has zero kinetic energy, and show that 
the electron energy in this case is less than the maximum 
value. 


14. 


16. 


17. 


18. 


19. 


14.6 
20. 


21. 


22; 


23. 


24. 


14.7 
25. 
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Find the QO values of the following decays: 

(a) wW®>yty (c) Dt >K>4+nt+4+nt 
(b) ut+—> p+? 

Find the QO values of the following decays: 

(a) m > +0, (c) D8 > M+y 

(b) K®° > at 4+n7 

Find the kinetic energies of each of the two product particles 
in the following decays (assume the decaying particle is at 
rest): 

(a) K° > at4+n7 (b) 37> >n4+77 

Find the kinetic energies of each of the two product particles 
in the following decays (assume the decaying particle is at 
rest): 

(a) Q- > AX+K- (6) mt ptt, 

A xX” with a kinetic energy of 0.250 GeV decays into 
mz + n. The x~ moves at 90° to the original direction of 
travel of the &~. Find the kinetic energies of 7~ and n and 
the direction of travel of n. 

A K° with a kinetic energy of 276 MeV decays in flight 
into z+ and z~, which move off at equal angles with the 
original direction of the K°. Find the energies and directions 
of motion of the z+ and x7. 


Energy and Momentum in Particle Reactions 


Show Eq. 14.6 reduces to Eq. 13.14 in the nonrelativistic 
limit. 

Determine the Q values of the following reactions: 

(a) K7>+p—> A°4+7n7° 

(b) m*+p—> xo++4+Kt 

() pt+p>ptat+e +e 

Determine the Q values of the following reactions: 

(a) y+n>2 +p 

(b) K>+p—> Q-+K*++K? 

(c) p+p>pt+xtt+K° 


Find the threshold kinetic energy for the following reactions. 
In each case the first particle is in motion and the second is 
at rest. 

(a) ptpon+Ut+K°4+nt 

(b) m +p—> u°+KkK? 


Find the threshold kinetic energy for the following reactions. 
In each case the first particle is in motion and the second is 
at rest. 

(a) p+n>p+>7+Kt 

(b) m+ +p>pt+p+n 


The Quark Structure of Mesons and Baryons 


Analyze the following reactions in terms of the quark content 
of the particles and reduce them to fundamental processes 
involving the quarks: 

(a) K>-+p>Q-4+K*++K°?® 

(b) m+ +p—> Ut4Kt 

(c) y+tn>a2 +p 
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26. 


27. 


28. 


29. 


Analyze the following reactions in terms of the quark content 
of the particles and reduce them to fundamental processes 
involving the quarks: 

(a) K-+ p> A®°+ n° 

(+) pt+p>ptat+A°+K° 

(c) y+p-> Dt+D°4n 

Analyze the following decays in terms of the quark content 
of the particles and reduce them to fundamental processes 
involving the quarks: 

(a) Q> > A +K- (c) mW oyty 





(6) n>pt+e 4+, (7) Dt >K-4+nt4+271 
Analyze the following decays in terms of the quark content 
of the particles and reduce them to fundamental processes 
involving the quarks: 

(a) K° > at4+a7 (c) E> > n+n7 

(b) A*t*t > p4+at (d) D> Kt4+a- 
Based on Figure 14.17, give the quark content of the six D 
mesons. 


General Problems 


30. 


31. 


32. 


Table 14.5 lists the most likely decay mode of the K* meson; 
Example 14.4 gives another possible decay. List four other 
possible decays that are allowed by the conservation laws. 
It is desired to form a beam of A° particles to use for the 
study of reactions with protons. The A° are produced by 
reactions at one target and must be transported to another 
target 2.0 m away so that at least half of the original A° 
remain in the beam. Find the speed and the kinetic energy 
of the A° for this to occur. 

Find a decay mode, other than that listed in Table 14.6, 
for (a) Q-; (b) A°; (c) =F that satisfies the applicable 
conservation laws. 


33) 


34. 


35. 


36. 


Consider the reaction p+p— p+p-+z° discussed in 
Example 14.6, but viewed instead from a frame of reference 
in which the two protons collide head-on with equal veloc- 
ities. (a) At threshold in this frame of reference, the product 
particles are formed at rest. Find the proton velocities in this 
case. (b) Use the Lorentz velocity transformation to switch to 
the laboratory frame of reference in which one of the protons 
is at rest, and find the velocity of the other proton. (c) Find 
the kinetic energy of the incident proton in the laboratory 
frame and compare with the value found in Example 14.6. 
The Df meson (rest energy = 1969 MeV, S = +1,C = +1; 
see Figure 14.17) has a lifetime of 0.5 x 107!” s. (a) Which 
interaction is responsible for the decay? (b) Among the 
possible decay modes are + *, w+ + v,,andK*+K. 
How do the S and C quantum numbers change in these 
three decays? (The @ meson has a spin of 1, a rest energy 
of 1020 MeV, and a quark content of ss.) (c) Analyze the 
three decay modes according to the quark content of the 
initial and final particles. (d) Why is the decay into K* + 
a* +27 allowed, while the decay into K~ +2? +277 is 
forbidden? 

Inthe decay K+ + a+ + 2+ 4+ x7 with the initial K meson 
at rest, what is the maximum kinetic energy of the pi mesons? 
A beam of z~ mesons with a speed of 0.9980c is incident 
on a target of protons at rest. The reaction produces two par- 
ticles, one of which is a K°® meson that is observed to travel 
with momentum 1561 MeV/c in a direction that makes an 
angle of 20.6° with the direction of the incident pions. 
(a) Find the momentum and the direction of the second 
product particle. (b) Find the energy of that particle. (c) Find 
the rest energy of the second particle and deduce its identity. 











COSMOLOGY: THE ORIGIN AND 
FATE OF THE UNIVERSE 


Today we scan the skies at all wavelengths from the very short (X rays and gamma rays) to 
the very long (radio waves). New and unexpected discoveries have occurred at these 
wavelengths: quasars, pulsars, supernovas, black holes — all of which suggest that the 
universe is not at all static and eternal, as was once believed, but instead is active, evolving, 
and teeming with radiation. Van Gogh’s painting The Starry Night suggests exactly that view, 
even though it was painted in 1889, long before any of these discoveries were made. 
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In the short time of a few hundred years, developments in astronomy have taken 
us from the belief that the Earth and its human population were the center of the 
universe, to a role that approaches insignificance. Before the 16th century, it was 
widely believed that the planets, Sun, Moon, and stars revolved about a central 
Earth. By the early 20th century, astronomers had discovered that we inhabit 
one minor star of a vast number in our galaxy, and that the universe contains an 
equally vast number of other galaxies. 

Gravity is the dominant force that determines the structure of the present 
universe, but Newton’s theory is insufficient to explain a number of observations 
of the motion of celestial objects. For this purpose we need a different theory, 
the general theory of relativity, which was proposed by Albert Einstein in 1916. 
Although the mathematics of this theory is beyond the level of this text, we can 
summarize some of its features and discuss its experimental predictions and their 
verification. Like the special theory, the general theory of relativity offers us a 
new way of thinking about space and time. 

In this chapter, we briefly survey the field of cosmology, the study of the 
universe on the large scale, including its origin, evolution, and future. For this 
study we must rely not only on relativity (special and general) and quantum theory, 
but also on fundamental results from atomic and molecular physics, statistical 
physics, thermodynamics, nuclear physics, and particle physics. 

We begin with three discoveries that fundamentally altered our concept of the 
universe: it is expanding, it is filled with electromagnetic radiation, and most of 
its mass is mysteriously hidden from our view. We show how these discoveries 
have been incorporated, using results from general relativity, into a theory of 
the origin of the universe known as the Big Bang theory. We then consider other 
measurements that support this theory, and we conclude with some speculations 
on the future of the universe. 


15.1 THE EXPANSION OF THE UNIVERSE 


The evidence for the expansion of the universe comes from the change in 
wavelength of the light emitted by distant galaxies. In Chapter 2, we analyzed 
a similar effect as the relativistic Doppler shift (Eq. 2.22), which we can write 
in terms of wavelength as 


j 1+v/c 1+v/c 
N=A] =A 15.1 
1—v/ce [Y=] et) 


where v represents the relative velocity between the source of the light and the 
observer. Here 4’ is the wavelength we measure on Earth and 4 is the wavelength 
emitted by the moving star or galaxy in its own rest frame. 

The light emitted by a star such as the Sun has a continuous spectrum. As light 
passes through the star’s atmosphere, some of it is absorbed by the gases in the 
atmosphere, so the continuous emission spectrum has a few dark absorption lines 
superimposed (see Figure 6.15). Comparison between the known wavelengths of 
these lines (measured on Earth for sources at rest relative to the observer) and 
the Doppler-shifted wavelengths allows the speed of the star to be deduced from 
Eq. 15.1. 

Of the stars in our galaxy, some are found to be moving toward us, with 
their light shifted toward the shorter wavelengths (blue), and others are moving 
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away from us, with their light shifted toward the longer wavelengths (red). The 
average speed of these stars relative to us is about 30km/s (10~4c). The change 
in wavelength for these stars is very small. Light from nearby galaxies, those of 
our “local” group, again shows either small blue shifts or small red shifts. 

However, when we look at the light from distant galaxies, we find it to 
be systematically red shifted, and by a large amount. Some examples of these 
measurements are shown in Figure 15.1. We do not see a comparable number of 
red and blue shifts, as we would expect if the galaxies were in random motion. 
All of the galaxies beyond our local group seem to be moving away from us. 

The cosmological principle asserts that the universe must look the same from 
any vantage point, and so we must conclude that any other observer in the universe 
would draw the same conclusion: The galaxies would be observed to recede from 
every point in the universe. 


Hubble’s Law 


In the 1920s, astronomer Edwin Hubble was using the 100-inch telescope on 
Mount Wilson in California to study the wispy nebulae. By resolving individual 
stars in the nebulae, Hubble was able to show that they are galaxies like the 
Milky Way, composed of hundreds of billions of stars. When Hubble measured 
the wavelength shifts of the light from the galaxies and deduced their speeds, he 
made two remarkable conclusions: the galaxies are moving away from us, and 
the farther away a galaxy is from us, the faster it is moving. This proportionality 
between the speed of the galaxy and its distance d is known as Hubble’s law: 


v= Ad (15.2) 


The proportionality constant Hp is known as the Hubble parameter. 

Figure 15.2a shows a plot of Hubble’s data for the deduced speeds against the 
distance. Although the points scatter quite a bit (due primarily to uncertainties in 
the distance measurements), there is a definite indication of a linear relationship. 
(Hubble’s distance calibration was incorrect, so the labels on the horizontal axis 
do not correspond to the actual distances to the galaxies.) More modern data 
based on observing supernovas in distant galaxies are shown in Figure 15.25. 
There is again clear evidence for a linear relationship, and the slope of the line 
gives a value of the Hubble parameter of about 72 km/s/Mpc*, within a range 
of about +10%. The Hubble parameter can also be determined from a variety of 
other cosmological experiments. These agree with the supernova data, and the 
best current value is 





km/s 
Hy = 72 —— 
Mpc 





The uncertainty in this value is on the order of 44%. 

The Hubble parameter has the dimension of inverse time. As we show later, 
Ay ' is a rough measure of the age of the universe. The best value of Hy gives an 
age of 14 x 10° y. If the speed of recession has been changing, the true age can 
be less than Ho’. 


*A parsec, pc, is a measure of distance on the cosmic scale; it is the distance that corresponds to one 
angular second of parallax. Because parallax is due to the Earth’s motion around the Sun, the parallax 
angle 2a is the diameter 2R of the Earth’s orbit divided by the distance d to the star or galaxy. Thus 
a = R/d radians, which gives 1 pe = 3.26 light-years = 3.084 x 10!> km. One megaparsec, Mpc, is 
10° pe. 


LT 





FIGURE 15.1 Red shifts of galaxies. 
(a) The horizontal band in the cen- 
ter shows a continuous spectrum with 
two dark lines superimposed (vertical 
arrow near left side), which repre- 
sent absorption by calcium. Above 
and below the absorption spectrum 
are emission spectra for calibration. 
The recessional speed of this galaxy 
(which is in our local group) is 
1200km/s, so the red shift is very 
small. (b) The absorption spectrum of 
a galaxy with a recessional speed of 
15,000 km/s. The red shift of the two 
calcium lines (indicated by the hor- 
izontal arrow) is significant. (c) The 
red shift of a galaxy with a recessional 
speed of 22,000 km/s. (Data courtesy 
Hale Observatories.) 
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FIGURE 15.2 (a) Hubble’s original data showing the linear relationship between the recessional speed of a 
galaxy and its distance from Earth. (6) Modern data showing Hubble’s law. Data are based on observations of 
supernovas in distant galaxies using the Hubble Space Telescope. The solid line represents a Hubble parameter 
of 72 km/s/Mpc, and the dashed lines show the limits corresponding to +7 km/s/Mpc. [Data from W. L. 
Freedman et al., Astrophysical Journal 553, 47 (2001).] 


How does the Hubble law show that the universe is expanding? Consider the 
unusual universe represented by the three-dimensional coordinate system shown 
in Figure 15.3a, where each point represents a galaxy. With the Earth at the origin, 
we can determine the distance d to each galaxy. If this universe were to expand, 
with all the points becoming further apart, as in Figure 15.34, the distance to each 
galaxy would be increased to ad’. Suppose the expansion were such that every 
dimension increased by a constant ratio & in a time f; that is, x’ = kx, and so forth. 
Then d’ = kd, and a given galaxy moves away from us by a distance d’ — d ina 


time ¢, so its apparent recessional speed is 





If we compare two galaxies | and 2, 
yd 
v2 7 dy 


the faster we observe it to be receding. 





a relationship identical with Hubble’s law, Eq. 15.2. Thus, in an expanding 
universe, it is perfectly natural that the further away from us a galaxy might be, 


Notice also from Figure 15.3 that this is true no matter which point we happen 


Edwin Hubble (1889-1953, United to choose as our origin. From any point in the “universe” of Figure 15.3, the other 
States). His observational work with points would be observed to satisfy Eq. 15.4 and thus also Hubble’s law. We can 
large telescopes revealed the existence further demonstrate this with two analogies. If we glue some spots to a balloon 
of galaxies, and he was the firsttomea- (Figure 15.4) and then inflate it, every spot observes all other spots to be moving 
sure their size and distance. Hubble’s away from it, and the farther away a spot is from any point, the faster its separation 
discovery of the recessional motion grows. For a three-dimensional analogy, consider the loaf of raisin bread shown 
of the galaxies was one of the most in Figure 15.5 rising in an oven. As the bread rises, every raisin observes all the 
exciting and important in the history others to be moving away from it, and the speed of recession increases with the 


of astronomy. separation. 
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(b) 


(b) 
FIGURE 15.3 The expansion of a 


coordinate space, showing that the FIGURE 15.4 As a balloon is 

apparent speed of recession depends inflated, every observer on the 

on the distance; d) is greater than d, surface experiences a velocity- 

and d, increases faster than d,. distance relationship of the 
form of the Hubble law. 


The correct interpretation of the cosmological redshifts requires the techniques 
of general relativity, which we discuss later in this chapter. According to general 
relativity, the shift in wavelength is caused by a stretching of the entire fabric of 
spacetime. Imagine small photos of galaxies glued to a rubber sheet. As the sheet is 
stretched, the distance between the galaxies increases, but they are not “in motion” 
according to the terms we usually use in physics to describe motion. However, 
the stretching of the space between the galaxies causes the wavelength of a light 
signal from one galaxy to increase by the total amount of the stretching before it 
is received at another galaxy. This is very different from the usual interpretation 
of the Doppler formula (Eq. 15.1). (In fact, for some galaxies the wavelength 
shift is so large that the special relativity formula would imply a recessional 
speed greater than the speed of light!) At low speeds, the Doppler interpretation 
of the redshift (that is, calculating a speed from the Doppler formula and using 
that speed in Hubble’s law) gives results that correspond with those based on an 
expansion of spacetime. However, for very large cosmological redshifts, a more 


correct analysis must be based on the stretching model: 
ae (15.5) 
XR 


where Ry represents a “size” or distance scale factor of the universe at the present 
time and R represents a similar factor at the time the light was emitted. 









(b) 


FIGURE 15.5 ~=Another system in 
which the Hubble law is valid. 


482 Chapter 15 | Cosmology: The Origin and Fate of the Universe 


George Gamow (1904—1969, United 
States). His significant contributions 
to nuclear physics (theories of alpha 
decay, beta decay, and _ nuclear 
structure), astrophysics (nucleo- 
synthesis, stellar structure), and cos- 
mology (the Big Bang theory) place 
him among the first rank of scientists. 
Gamow was also one of the most suc- 
cessful writers of popular science, to 
which he brought unusual and amus- 
ing perspectives. 





The expansion of the universe has been widely accepted since Hubble’s 
discoveries in the 1920s. There are, however, two interpretations of this expansion. 
(1) If the galaxies are separating, long ago they must have been closer together. 
The universe was much denser in its past history, and if we look back far enough 
we find a single point of infinite density. This is the “Big Bang” hypothesis, 
developed in 1948 by George Gamow and his colleagues. (2) The universe has 
always had about the same density it does now. As the galaxies separate, additional 
matter is continuously created in the empty space between the galaxies, to keep 
the density more or less constant. This is the “Steady State” hypothesis, proposed 
also in 1948 by astronomer Fred Hoyle and others. New galaxies created from 
this new matter would make the universe look the same not only from all vantage 
points, but also at all times in the present and future. (To keep the density constant, 
the rate of creation need be only about one hydrogen atom per cubic meter every 
billion years.) 

Both hypotheses had their supporters, and during the 1940s and 1950s the 
experimental evidence did not seem to favor either one over the other. In the 1960s, 
the new field of radio astronomy revealed the presence of a universal background 
radiation in the microwave region, which is believed to be the remnant radiation 
from the Big Bang. This single observation has propelled the Big Bang theory to 
the forefront of cosmological models. 


15.2 THE COSMIC MICROWAVE BACKGROUND 


RADIATION 





When a gas expands adiabatically, it cools. The same is true for the universe: the 
expansion is accompanied by cooling. As we go back in time, we find a hotter, 
denser universe. Far enough back in time, the universe would have been too 
hot for stable matter to form. Its composition was then a “gas” of particles and 
photons. The unstable particles eventually decayed to stable ones, and the stable 
particles eventually clumped together to form matter. The photons that filled 
the universe remained, but their wavelengths were stretched by the continuing 
expansion. Today those photons have a much lower temperature, but they still 
uniformly fill the universe. 

The wavelength spectrum of those photons is that of an isolated object 
(blackbody) emitting thermal radiation at the temperature 7 that characterizes the 
universe at a particular time. The wavelengths change as the universe expands, 
but the radiation retains an ideal thermal spectrum at a temperature that decreases 
with time. In the 1940s, the Big Bang cosmologists (Gamow and others) predicted 
that this “fireball” would today be at a temperature of the order of 5 to 10 K; such 
photons would have a typical energy kT of the order of 10~ eV or a wavelength 
of order | mm, in the microwave region of the spectrum. 

The properties of this background radiation can be described using the 
formulas for thermal radiation we developed in Section 10.6. The number of 
photons dN in the energy interval dE at E (that is, with energies between E and 
E+dE) was given by Eq. 10.38. Writing that equation as the number per unit 
volume (number density), we get 


N(E)dE_ 8xE* 1 
Ve (he)3 eB kT — | 





dE (15.6) 
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To find the total number of photons of all energies per unit volume we integrate 
Eq. 15.6 over energy: 


N | ¢& 87 © E-dE 870 0 x? dx 
—=—]| N(£)dE= = kT)? 
V al a (hc)3 [ eb/kT _ | (ho)? | ) [ e—] 


(15.7) 








where we have substituted x = E/kT. The definite integral is a standard form with 
a value approximately equal to 2.404. Equation 15.7 shows that the total number 
of photons per unit volume is proportional to the cube of the temperature, and 
evaluating the constants we find 


N/V = (2.03 x 10’ photons/m3. K*)7? (15.8) 


We can write Eq. 10.41 for the energy density (energy per unit volume) in the 
same form by evaluating the constants: 


8 kt 


= ——_T* = (4.72 x 10° eV/m? - K*)T* 15.9 
pa ee re 


and the mean (average) energy per photon at temperature 7 is obtained from the 
ratio of Eqs. 15.9 and 15.8: 
U —4 
Ew = a = (2.33 x 10“ eV/K)T (15.10) 
N/V 

We now look at the experimental evidence for the existence of this microwave 
radiation and the determination of its temperature. From Eq. 10.42 we see that 
the measurement of the radiant energy density at any wavelength is enough for 
a determination of the temperature 7, although to demonstrate that the radiation 
actually has an ideal thermal spectrum requires measurement over a range of 
wavelengths. 

The first experimental evidence for this radiation was obtained in a 1965 
experiment by Arno Penzias and Robert Wilson, who used a microwave antenna 
tuned to a wavelength of 7.35 cm. At this wavelength they recorded an annoying 
“hiss” from their antenna that could not be eliminated, no matter how much care 
they took in refining the measurement. After painstaking efforts to eliminate the 
“noise,” they concluded that it was coming from no identifiable source and was 
striking their antenna from all directions, day and night, summer and winter. From 
the radiant energy at that wavelength they deduced a temperature of 3.1 + 1.0K, 
and it was later concluded that the radiation was the present remnant of the Big 
Bang “fireball.” For this experiment, Penzias and Wilson shared the 1978 Nobel 
Prize in physics. 

Since that original experiment there have been many additional studies, at 
various wavelengths in the range 0.05 to 100. cm, all giving about the same tem- 
perature. The most recent measurements were made with the Cosmic Background 
Explorer (COBE) satellite, which was launched into Earth orbit 1989, and the 
Wilkinson Microwave Anisotropy Probe (WMAP) satellite, which was launched 
into solar orbit in 2001. Previously, no precise data from Earth-bound observations 
were available below a wavelength of 1 cm because of atmospheric absorption. 
The COBE and WMAP satellites were able to obtain very precise data on the 
intensity of the background radiation in the wavelength range between 1 cm and 
0.05 cm (0.0001 eV and 0.0025 eV). 
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FIGURE 15.6 Energy density of the 
microwave background. The data 
points are from the COBE satellite, 
and the error bars have been mul- 
tiplied by a factor of 400 to make 
them visible. The solid curve is the 
expected thermal radiation spectrum 
(Eq. 10.39) for a temperature of 2.725 
K. (Data from Legacy Archive for 
Microwave Background Data Analy- 
sis, NASA Office of Space Science.) 


The results from the COBE satellite are summarized in Figure 15.6. The data 
points fall precisely on the solid line, which is calculated from Eq. 10.39 for a 
temperature T = 2.725 K. For these spectacularly beautiful data, experimenters 
John Mather and George Smoot were awarded the 2006 Nobel Prize in physics. 

Other experiments show that the radiation has a uniform intensity in all 
directions. It comes from no particular source, but instead fills the universe today 
as it did in the early times just after the Big Bang. 

Using the deduced temperature of 2.7 K, we can calculate from Eqs. 15.8 to 
15.10 that there are about 4.0 x 10° of these photons in every cubic meter of space, 
that they contribute to the universe an energy density of about 2.5 x 10° eV/m? 
(about half the rest energy of an electron), and that each photon has an average 
energy of about 0.00063 eV. The number of photons is particularly important, 
because for nearly all of the last 14 x 10° y the ratio of the number of nucleons 
(protons and neutrons) to photons has been almost constant. This has important 
consequences for the Big Bang cosmology. 


15.3 DARK MATTER 


Figure 15.7 shows spiral galaxies that are similar to our Milky Way galaxy, in 
which about 10!! stars are bound together by the gravitational force. The diameter 
of a typical galaxy might be 10—50 kpc (0.3—1.5 x 10'8 km). Many galaxies 
have this spiral structure, with a bright central region (containing most of the 
galaxy’s mass) and several spiral arms in a flat disk. The entire structure rotates 
about an axis perpendicular to the plane of the disk. The Sun, which is in one of 
the spiral arms of our galaxy at a distance of 8.5 kpc from the center (about 2/3 
of the radius of the disk), has a tangential velocity of 220 km/s. At this speed, it 
takes about 240 million years for a complete rotation; during the lifetime of the 
solar system of about 4.5 billion years, the Sun has made about 20 revolutions. 
Because the stars in the galaxy are bound by the gravitational force, we can use 
Kepler’s laws to analyze the motion. We assume that the gravitational force on 
the Sun is due primarily to the dense region at the center of the galaxy; the total 





FIGURE 15.7 Spiral galaxies similar to the Milky Way, viewed from two different perspectives—one normal to the plane 
and one along the plane. 


mass of the other stars in the spiral arm is much smaller than the central mass, so 
they make a negligible contribution to the force on the Sun. Kepler’s third law 
relates the period T of the orbit to the radius: 


pal \2 (15.11) 
=|{—]r : 
GM 
With 7 = 2zr/v, where v is the tangential velocity, we obtain 
GM 
en sala (15.12) 
r 


Here M refers to the mass contained within the region of radius r. The Sun’s 
tangential velocity suggests that a mass equivalent to 10! solar masses lies within 
the Sun’s orbit. 

According to this model, we expect stars beyond the Sun to have tangential 
velocities that decrease with increasing radius like r~'/*. (The planets in the solar 
system follow this expectation to very high precision.) However, we observe that 
for the rotation of the galaxy v is constant or perhaps increases slightly for stars 
beyond the Sun (Figure 15.8). 

Other spiral galaxies show the same effect. The tangential speeds of stars in 
distant galaxies can be measured by the Doppler shift of their light. If we are 
viewing a galaxy along the plane of the disk, then one side will always be moving 
toward us and the other will always be moving away from us. The difference 
between the Doppler shifts of the light from the two sides of the galaxy then tells 
us about its rotational speed, independently of the net motion of the entire galaxy. 
From this measurement, we can determine how the tangential velocity of the 
galaxy depends on the distance from its center. A typical set of results is shown 
in Figure 15.9. Once again, the velocity fails to follow the expected relationship 
and instead remains constant throughout the visible part of the galaxy. 

These results are not consistent with Kepler’s law, which is based on a large 
central mass attracting each star toward the center of the galaxy. On the contrary, 
to explain a velocity that is constant as a function of radius we must have a 
mass M that increases linearly with r (see Eq. 15.12). However, this explanation 
is inconsistent with visual observations of the galaxies, which clearly show that 
most of the light, and therefore presumably most of the mass, is concentrated in 
the central region. 

To resolve this dilemma, it has been concluded that there is a large quantity 
of invisible matter in galaxies—matter that must be present in the galaxy to 
supply the gravitational force, but that does not give off any light (or other 
electromagnetic radiation). To supply the required gravitational force, this dark 
matter must have a mass at least 10 times the mass of the visible matter in the 
galaxy. That is, more than 90% of the matter in the galaxy is in some unknown 
and invisible form. This dark matter permeates the region of space occupied by 
the galaxy and may surround it in the form of a halo. 

Galaxies are observed in gravitationally bound clusters of typically 100 
members. The size of a cluster is perhaps | Mpc, about 100 times as large as a 
typical galaxy. Because these clusters rotate about their common center, we can 
perform similar measurements to compare the rotational speed of a galaxy with 
its distance from the center. As with individual galaxies, the clusters follow the 
dependence shown in Figure 15.9, suggesting that there is more matter in the 
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FIGURE 15.8 Tangential velocities 
of stars in our galaxy, determined from 
the Doppler shift of their light. The 
solid line is the prediction based on 
Kepler’s third law, Eq. 15.12. 
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FIGURE 15.9 The tangential rota- 
tional velocity of a distant galaxy as 
a function of the distance from its 
center. The solid line is the expected 
r_'/? dependence at distances beyond 
the central concentration of stars. 
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Vera Rubin (1928—, United States). 
An observational astronomer, she has 
made pioneering discoveries about 
the motion of stars and galaxies. By 
observing the Doppler shifts of stars in 
galaxies, she deduced that their rota- 
tional velocities are not consistent with 
attraction only by a large concentra- 
tion of mass at the galactic center. 
Rubin’s work has been among the 
leading contributions to understanding 
the existence and amount of dark mat- 
ter in the universe. She was awarded 
the National Medal of Science in 1993. 
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clusters than we can account for from the galaxies alone. We conclude that dark 
matter also surrounds clusters of galaxies. 

Support for the existence of dark matter comes from the observation of light 
from distant galaxies that passes by a cluster of galaxies on its way to Earth. This 
light is deflected by the gravitational field of the cluster in a process known as 
“gravitational lensing.” From the amount of the deflection of the light, it is possible 
to deduce the quantity of matter in the cluster. The results of these observations 
show that there is much more matter in the cluster than we would deduce from 
the luminous matter alone, suggesting the presence of relatively large amounts of 
dark matter. A dramatic illustration of this effect occurs in the colliding galaxies 
of the “Bullet Cluster,” first analyzed in 2006, in which the distribution of ordinary 
matter (revealed by the glowing interstellar gas) is different from the distribution 
of dark matter (revealed by the gravitational lensing effect). 

What kinds of objects make up this dark matter? Speculations about its 
nature are divided loosely into two categories: MACHOs (Massive Compact Halo 
Objects) and WIMPs (Weakly Interacting Massive Particles). Possible MACHOs 
include massive black holes, neutron stars, burnt-out white dwarf stars, or brown 
dwarf stars (Jupiter-sized objects of too small a mass to become a star). The 
WIMPs include neutrinos, magnetic monopoles, and other exotic types of stable 
elementary particles produced during the Big Bang. The major difference between 
the two types of objects is that MACHOs are made from baryons (protons and 
neutrons, like ordinary matter), while WIMPs are made from some other, more 
exotic type of non-baryonic matter. Current theories suggest that most of the dark 
matter is of the non-baryonic form, but (aside from neutrinos) no examples of this 
type of matter have yet been produced in any laboratory on Earth. 


15.4 THE GENERAL THEORY OF RELATIVITY 


The interpretation of the observational data describing our universe must be 
done using methods from the general theory of relativity, which is in essence a 
theory of gravitation developed by Albert Einstein between 1911 and 1915. The 
mathematical level of this theory is beyond the level of this text, but we will try 
to appreciate how the theory helps us to understand the structure and evolution of 
the universe. 

The special theory of relativity arose from a thought experiment of Einstein’s 
in which he imagined trying to catch up with a light beam. The general theory 
also arose from a thought experiment. Here are Einstein’s words: 


I was sitting in a chair in the patent office at Bern when all of a sudden 
a thought occurred to me: If a person falls freely he will not feel his own 
weight. This simple thought made a deep impression on me. It impelled me 
toward a theory of gravitation. 


Figure 15.10 illustrates a freely falling person in two situations: in the Earth’s 
gravity and in interstellar space where the gravitational field is negligibly weak. 
In both cases the person is in an isolated chamber and therefore unable to use 
outside objects to deduce the motion of the chamber. From within the chamber 
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(a) (b) (a) 
FIGURE 15.10 The effects of freely falling appear identi- FIGURE 15.11 The effects appear identical when the 
cal from within the chamber in (a) the Earth’s gravity and chamber is (a) at rest in a uniform gravitational field and 
(b) interstellar space. (b) accelerating in interstellar space. 


the two cases look exactly equivalent; no measuring instrument operating entirely 
within the chamber can distinguish between the two cases. An acceleration a = g 
in a gravitational field g is equivalent to an acceleration of 0 in a negligible 
gravitational field. 

It appears that an acceleration is able to “cancel out” the effects ofa gravitational 
field. Let us go one step further and ask whether an acceleration can produce the 
effects of a gravitational field. Consider the situations illustrated in Figure 15.11. 
In one case the observer is at rest near the Earth, where the gravitational field 
is g. In the other case, the observer is in empty space where the gravitational 
field is negligibly small, but the rocket engines are firing so that the chamber has 
an acceleration a = —g. There are various experiments in the chamber: a scale 
displays the weight of the observer (actually, the normal force exerted on the 
observer by the scale), a ball drops to the floor, a mass stretches a spring, and 
a pendulum oscillates. All of these experiments give identical results in the two 
chambers. Once again, there is no experiment that can be done within the chamber 
to distinguish the two cases. 

This leads us to the principle of equivalence: 


There is no local experiment that can be done to distinguish between the 
effects of a uniform gravitational field in a nonaccelerating inertial frame 
and the effects of a uniformly accelerating (noninertial) reference frame. 


By “local” we mean that the experiments must be done within the chamber, and 
also that the chamber must be sufficiently small that the gravitational field is 
uniform. Near the surface of the Earth, for example, not all ¢ vectors inside the 
chamber would be parallel; they point toward the center of the Earth, so there 
would be a slight angle between the g vectors on opposite sides of the chamber. If 
we make the chamber small, this effect is negligible and the g vectors everywhere 
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FIGURE 15.12 A source S emits a 
light wave that is recorded by a detec- 
tor D in a chamber that is accelerating 
upward. 


in the chamber, like the a vectors in the accelerating chamber, are parallel to one 
another. 

The principle of equivalence appears in a slightly different (and weaker) 
form in introductory physics, where it is stated in terms of the equivalence of 
inertial and gravitational mass. That is, the mass m that appears in the expression 
F = ma (inertial mass) is identical to the mass m that appears in the expression 
F = GMm/r’ (gravitational mass). It follows from this form of the principle 
of equivalence that all objects, regardless of their masses, fall with the same 
acceleration in the Earth’s gravity. This was first tested by Galileo in the famous 
(and perhaps apocryphal) experiment in which he dropped two different masses 
from the top of the leaning tower of Pisa and observed them to fall at the same rate. 
In recent years other more precise experiments have established the equivalence 
of gravitational and inertial mass to about | part in 10!!. 

Einstein realized that the principle of equivalence applied not only to mechan- 
ical experiments but to all experiments, even ones based on electromagnetic 
radiation. Consider the arrangement shown in Figure 15.12. At the top of the 
chamber is a light source that emits a wave of frequency f. At the bottom of the 
chamber and a distance H away is a detector that observes the wave and measures 
its frequency. When the light wave is emitted in the accelerating chamber, the 
source has speed v, which we assume to be small compared with the speed of 
light c. When it is detected, after a time of flight t ~ H//c, the floor is moving with 
a speed v + at. In effect there is a relative speed Av = at between the source and 
the detector, so there is a Doppler shift in the frequency given by Eq. 2.22: 


io {LPAVS 
f =f rae ~ f(1 + Av/c) (15.13) 


or, in terms of the frequency difference Af = f’ — f, 


Af Av at aH 
_ rat i (15.14) 





f c c c 


Now let us compare the result of this experiment with a similar one done in 
a chamber at rest in a uniform gravitational field g. If the results of the two 
experiments are to be identical (as required by the principle of equivalence), there 
must be a frequency shift given by Eq. 15.14 with a = g: 

Af _ gH 

Fg (15.15) 
The principle of equivalence thus predicts a change in frequency of a light wave 
falling in the Earth’s gravity. 

In 1959, R. V. Pound and G. A. Rebka allowed 14.4-keV photons from the 
radioactive decay of *’Co to fall down the Harvard tower, a distance of 22.6 m. The 
expected fractional change in frequency, Af/f = gH /c’, was 2.46 x 107°; that 
is, to detect the effect, they had to measure the frequency or energy of the photon 
at the bottom of the tower to a precision of about | part in 10'°! The Méssbauer 
effect (Section 12.9) makes it possible to achieve such a level of precision, 
and the measured result was Af /f = (2.57 + 0.26) x 107!5, consistent with the 
equivalence principle. Similar experiments based on comparisons between the 
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frequency of radiation emitted by satellites and received by ground stations have 
confirmed the predictions of the principle of equivalence to a precision of about 
1 part in 104. 

Because the Global Positioning System (GPS) relies on frequency measure- 
ments on the surface of the Earth from transmitters in orbiting satellites, its 
accuracy depends on applying a correction due to the gravitational frequency shift 
predicted by general relativity. Without this correction, errors in the GPS locating 
system of roughly 10 km per day would accumulate. 

Note that in Eq. 15.15 the frequency shift depends on the difference in 
gravitational potential AV (potential energy per unit mass) between the source 
and the detector: 





A H — 
AV = U _ (mg. Or 
mM 


m 


eH (15.16) 


For the satellite, even though the gravitational field through which the radiation 
travels is not uniform, the same conclusion holds: the frequency shift depends 
on the difference in gravitational potential between the source and the observer. 
Consider, for example, light leaving the surface of a star of mass M and radius R. 
The gravitational potential at the surface is V = —GM/R. If the light is observed 
on the Earth, where the gravitational potential is negligible compared with that of 
the star, the frequency shift is 
Af AV GM 


= 15.17 
f ron Re? ( ) 





Photons climbing out of a star’s gravitational field lose energy and are therefore 
shifted to smaller frequencies or longer wavelengths (red shifted). This effect is 
difficult to observe for two reasons: (1) the motion of the star causes a Doppler 
shift that is generally greater than the gravitational shift, and (2) the spectral lines 
are Doppler broadened by the thermal motion of the atoms near the surface of a 
star (see Section 10.4). Nevertheless, the effect has been confirmed for a few stars 
including the Sun. 


| Example 15.1 





The Lyman a line in the hydrogen spectrum has a wave- The shift in wavelength is 
length of 121.5 nm. Find the change in wavelength of this Ad = (2.12 x 10-5)(121.5 nm) 
line in the solar spectrum due to the gravitational shift. 





Solution = 0250 

From Eq. 15.17, we have This shift in wavelength is small in comparison with the 
Ak Af GM aia ane a ae ae rane = the thermal 
C= 7 ee roadening of its spectral lines (see Problem 6). 


(6.67 x 107! N-m?/kg”)(1.99 x 10°° kg) 
(6.96 x 108 m)(3.00 x 108 m/s) 
=2.12 x 10~°° 
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(c) 


FIGURE 15.13 (a) The path through 
flat spacetime of a particle moving at 
constant velocity. (b) The path through 
flat spacetime of an accelerating parti- 
cle. (c) The path of a particle through 
spacetime curved by matter. 


Space and Time in General Relativity 


From special relativity we learn that the laws of physics must be the same in 
all inertial frames, and that there is no preferred inertial frame relative to which 
it is possible to determine the absolute velocity of an observer. From this point 
of view, it seems that an accelerated (noninertial) frame is a preferred frame, 
because it is possible to determine the absolute acceleration. In the general theory, 
Einstein sought to remove this restriction, so that motion would be relative for 
all observers, even accelerated ones. The principle of equivalence, which tells us 
that we can’t distinguish between acceleration and a gravitational field, removes 
acceleration from its privileged role. 

Ultimately, general relativity is a theory of geometry. The motion of a particle 
is determined by the properties of the space and time coordinates through which 
it moves. Because space and time are intimately coupled in relativity (see, 
for example, the Lorentz transformation in Section 2.5), we regard them as 
components of a combined coordinate system called spacetime. 

The equivalence between accelerated motion and gravity suggests a relationship 
between spacetime coordinates and gravity. In the classical description, we would 
say that the presence of matter sets up a gravitational field, which then determines 
how objects move in response to that field. According to general relativity, we 
say that the presence of matter (and energy) causes spacetime to warp or curve; 
the motion of particles is then determined by the shape of the coordinate system. 
It is sometimes said that “Geometry tells matter how to move, and matter tells 
geometry how to curve.” From the configuration of mass and energy, general 
relativity gives us a procedure for calculating the curvature of spacetime, and the 
motion of a particle or a light beam then follows directly. 

Let’s consider the simple case of a particle that moves in only one 
dimension—for example, a bead sliding without friction on a straight wire. We 
can plot the motion of the bead on an xt coordinate system, which is the two- 
dimensional spacetime of the bead. For example, if the bead moves with constant 
velocity along the wire from position x, at time ¢, to position xp at time fp, the 
motion in spacetime is represented by the straight line shown in Figure 15.13a. 

Now suppose the wire is mounted vertically in an accelerating chamber in a 
gravity-free environment. To an observer in the chamber, the bead will appear to 
accelerate downward as the chamber is accelerated upward. The path in spacetime 
is now curved, as suggested by Figure 15.130. 

If the acceleration is replaced by the equivalent gravitational field, the motion 
of the bead, as observed from inside the chamber, is exactly the same—the bead 
appears to accelerate downward. General relativity describes this situation as a 
change in the shape of spacetime; the presence of matter (which classically we 
would describe as the source of the gravitational field) distorts the xt spacetime as 
indicated in Figure 15.13c. If we imagine the spacetime coordinate system as a 
grid laid out on a rubber sheet, the gravitating matter stretches the sheet, and the 
particle moves from A to B along the most direct path in the curved spacetime. 

It is convenient to define the spacetime interval ds, in effect the separation 
between two events (such as the particle passing through successive points) in 
two-dimensional spacetime, as 


(ds)* = (c dt)? — (dx)” (15.18) 
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This quantity is invariant under the Lorentz transformation, as you can prove by 
substituting dx’ and dt’ from Eq. 2.23. The trajectory of the particle in spacetime 
can be regarded as a collection of infinitesimal intervals. The particle is merely 
following the contour of spacetime, so the interval serves both to define the 
trajectory and to represent the shape of spacetime. The extension to three spatial 
dimensions gives an interval 





(ds)? = (ec dt)’ — (dx)’ — (dy)’ — (dz) (15.19) 


To characterize a “curved” four-dimensional spacetime, we might write the 
interval as 


(ds)? = gy(c dt)? — g, (dx)* — g,(dy)’ — g,(az)" (15.20) 


where the four coefficients g; describe the curvature of the spacetime and its 
deviation from a Euclidian nature (for which all g; = 1). 

The interval of Eq. 15.19 is characteristic of our familiar Euclidian space, 
which we call “flat.” Figure 15.14 summarizes some of the characteristics of 
that space: a straight line is the shortest distance between two points, the sum of 
the angles of a triangle is 180°, parallel lines never meet, the ratio between the 
circumference and the diameter of a circle is 77, and so forth. 

Figure 15.15 shows a curved non-Euclidian geometry, the surface of a sphere. 
Here the shortest distance between points is an arc of a great circle, the sum 
of the angles of a triangle is greater than 180°, parallel lines can meet, and the 
ratio between the circumference and the diameter of a circle is less than 2. The 
saddle-shaped geometry shown in Figure 15.16 has a different kind of curvature, 
in which the ratio between the circumference and the diameter of a circle is greater 
than z. 

To appreciate the importance of curved spacetime, consider the experiment 
illustrated in Figure 15.17. A light beam is emitted by a source in the chamber and 
travels across the chamber to the opposite wall. If the chamber is in an inertial 
frame and free from gravitational fields, the beam travels horizontally across the 
chamber and strikes the opposite wall at the same height above the floor as the 
source. (This holds even if the chamber moves at constant velocity, as you can 
prove using special relativity.) Observers inside and outside the chamber agree 
on this conclusion. 

If the chamber accelerates, the situation is different. Suppose the light is 
emitted when the chamber is at rest, relative to a particular inertial frame. 
The light beam then has no transverse component of velocity in this frame 
and moves horizontally. As the chamber accelerates, the light beam acquires 
no transverse velocity component, but the chamber’s velocity increases. To the 
inertial observer, the beam travels along a horizontal straight line, but the chamber 
accelerates forward so that the beam strikes the opposite wall at a lower height 
than the source. To an observer in the chamber, the beam appears to follow the 
curved path shown in Figure 15.17a and strikes the wall at a lower height than 
the source. 

According to the principle of equivalence, the observer in the chamber should 
record the same outcome if the chamber is at rest in a uniform gravitational field 
(Figure 15.175). General relativity explains this observation through the curvature 
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FIGURE 15.17 (a) According to an observer in an accelerating 
chamber, the light beam follows a curved path. (b) An observer at 
rest in a uniform gravitational field finds the same outcome. 


of spacetime in the vicinity of the mass that is responsible for the gravitational 
field. The light beam is merely seeking out the shortest possible path in the 
curved spacetime, just like an ant crawling along a line on the spherical surface 
of Figure 15.15. All paths in the curved spacetime are curved. 

It is tempting to seek an alternative explanation for the outcome shown in 
Figure 15.17. For instance, we can assume each photon in the light beam to have 
an effective mass m = E/c* and then calculate its trajectory in the gravitational 
field as we would that of any classical particle of mass m. However, as we discuss 
in the next section, this method gives results that do not agree with observations 
for the path of photons in a gravitational field. The curvature of spacetime, which 
provides the correct explanation, is an inescapable consequence of the principle 
of equivalence. 

General relativity gives a relationship between curvature and the density of 
mass and energy in space, which can be written symbolically as 


81 G . 
curvature of space = eal (mass-energy density) (15.21) 
CG 


Note that this expression incorporates gravitation (Newton’s constant G) and 
special relativity (the speed of light c). If no matter or energy is present, the 
right-hand side is zero; as a result, the curvature is zero and space is flat. In the 
limit of classical kinematics (c > oo) and in the limit of weak gravitational fields 
(G — 0), space is nearly flat and we can safely use the Newtonian gravitational 
theory. This is equivalent to saying that if we take a small enough region of the 
sphere of Figure 15.15, or if we increase its radius to a sufficiently large value, 
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the geometry is approximately Euclidian. Just as classical kinematics can be 
regarded as the limiting case of special relativity (for low speeds), so can classical 
gravitation be regarded as the limiting case of general relativity (for weak fields). 
In calculating the orbit of an Earth satellite or the trajectory of a space probe to 
Mars, Newton’s theory gives entirely satisfactory results. Close to the Sun and to 
compact or massive stars, the curving of space can lead to observable effects, as 
we discuss in the next section. 
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15.5 TESTS OF GENERAL RELATIVITY 


Newtonian gravitation and Einstein’s general relativity each give predictions that 
can be tested against experiment, but under most circumstances the differences 
between the two predictions are extremely small. At the surface of the Earth, 
space is curved by only about | part in 10°; even at the surface of the Sun, the 
curvature is only about 1 part in 10°. 
Nevertheless, there are experiments we can do that are precise enough to detect 
the difference between flat spacetime and curved spacetime. In this section we Q 
Earth 





discuss several of these experiments. 


Deflection of Starlight FIGURE 15.18 A light beam pass- 


When a beam of light from a star passes close to the Sun, it is deflected from ing near the Sun is deflected. To an 
its original direction, as shown in Figure 15.18. The star appears to be displaced _ bserver on Earth, the star at A appears 
from its true position by an angle 6. to be at B. 
It is possible to analyze this situation using Newtonian gravitation and special 
relativity by assigning the photons in the beam an effective mass m = E/c* 
and assuming them to be deflected by the Newtonian gravitational force. The 
experiment then looks very much like Rutherford scattering, and by analogy with 
the Rutherford scattering formula (see Problem 9) it is possible to calculate the 
deflection angle (in radians): 





2GM 
é= 
Re? 


where M is the mass of the Sun and R is its radius. Substituting the numbers gives 

0 = 0.87" as the prediction of special relativity and Newtonian gravitation. 
General relativity gives a different view. Spacetime in the vicinity of the Sun 

is curved, and the light beam is simply following the most direct path along 


the curved spacetime (Figure 15.19 is a two-dimensional representation of this Ty 
effect). According to general relativity, the expected deflection is 1.74”, exactly 
twice the value predicted by the Newtonian formula. 
Measuring this effect requires the observation of a beam of light, such as from 
a star, that passes near the edge of the Sun. Starlight near the Sun can be observed 


only during a total solar eclipse. In 1919, just a few years after Einstein completed 

his general theory, two expeditions of British astronomers traveled to Africa and FIGURE 15.19 The path of a light 
to South America to observe the solar eclipse and to measure the apparent changes beam from a star through curved 
in positions of stars whose light grazed the Sun. Their results for the deflection spacetime. 


(15.22) 
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FIGURE 15.20 An electromagnetic 
signal travels between Earth and 
Venus at superior conjunction. 
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FIGURE 15.21 Path of signal be- 
tween Earth and Venus in curved 
spacetime. 
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FIGURE 15.22 The elliptical orbit of 
a planet about a star. 





FIGURE 15.23 Precession of the per- 
ihelion (greatly exaggerated). After 
each orbit, the perihelion advances by 
an angle Ad. 





angles, 1.98” + 0.18” and 1.69” + 0.45”, gave strong support for the new general 
theory. In the years since those early results, this experiment has been repeated at 
nearly every total solar eclipse, and the overall agreement with general relativity 
is within 10%. Radio emission from quasars has also been used to confirm this 
effect, and here the agreement with general relativity is within 2%. 

These experimental results give a clear distinction between Newtonian gravity 
(even with special relativity included) and general relativity. 


Delay of Radar Echoes 


When a line joining Earth and another planet (Venus, for example) passes through 
the Sun, the situation is known as “superior conjunction” and is illustrated in 
Figure 15.20. Based on the orbits of Earth and Venus, we can calculate how long 
it takes a radar signal sent from Earth to be reflected from Venus and return to 
Earth (about 20 minutes). Near superior conjunction, the signal passes close to the 
Sun, and therefore, according to general relativity, it does not travel in a Euclidian 
straight line, but instead follows a path through curved spacetime (Figure 15.21). 
It therefore takes the signal a bit longer than the expected time to make the round 
trip (think of the time intervals as extended as the beam passes close to the Sun, 
thus lengthening the time to travel the path). This time delay is expected to be 
about 1074s, and it has been confirmed to within a few percent (the limit on 
precision being uncertainties in the surface of Venus, since we don’t know if the 
signal is being reflected from a mountain or a valley). More precise experiments 
were done in the late 1970s using a signal sent between Earth and the Viking 
landers on Mars. In this case, the result was consistent with general relativity to 
within about 0.1%. Signals from NASA’s Cassini spacecraft, which entered orbit 
around Saturn in 2004, have provided the most sensitive test of the time delay, 
agreeing with the predictions of general relativity to within 0.002%. 


Precession of Perihelion of Mercury 


Consider a simple planetary system, shown in Figure 15.22, consisting of a single 
planet in orbit about a star of mass M such as the Sun. According to Newtonian 
gravitation, the orbit is a perfect ellipse with the star at one focus. The equation 
of the ellipse is 


l+e 


™ 1+ ecosd 


(15.23) 
where 7, iS the minimum distance between planet and star and e is the 
eccentricity of the orbit (the degree to which the ellipse is noncircular; e = 0 fora 
circle). When r = r,,;,, the planet is said to be at perihelion; this occurs regularly, 
at exactly the same point in space, whenever ¢ = 0,277,47,.... According to 
general relativity, the orbit is not quite a closed ellipse; the curved spacetime 
near the star causes the perihelion direction to precess somewhat, as shown in 
Figure 15.23. After completing one orbit, the planet returns to ryj,, but at a 
slightly different ¢. The difference Ag can be computed from general relativity, 
according to which the orbit is 


l+e 
= f..* 
™ 14 ecos(d — Ad) 





(15.24) 
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where 


61GM 


Ag = =~ 
Y CF min A +e) 


(15.25) 


For the Sun, 67GM /c? = 27.80 km, and thus even for the smallest value of min 
(for Mercury, 46 x 10°km) Ag is of order 10~° rad, an extremely small quantity. 
However, this effect is cumulative; that is, it builds up orbit after orbit, and after NV 
orbits, the perihelion has advanced by NA@. We usually express this precession 
in terms of the total precession per century (per 100 Earth years), and some 
representative values are shown in Table 15.1. 

The expected precessions are very small, of the order of seconds of arc per 
century, but nevertheless have been measured with great accuracy; for the three 
planets closest to the Sun, and for the asteroid Icarus, the measured values are 
in agreement with the predictions of general relativity. In the best case, the 
agreement is within about 1%. 

These experiments are very difficult to do because (except for Mercury and 
Icarus) the eccentricities are small and locating the perihelion is difficult. A 
more serious problem is that other effects, not associated with general relativity, 
also cause an apparent precession of the perihelion. In the case of Mercury, the 
observed precession is actually about 5601” per century; of that, 5026” are due to 
the precession of the Earth’s equinox (a classical Newtonian effect of the spinning 
Earth) and 532” are due to the gravitational pull of the other planets on Mercury 
(also a classical Newtonian effect). Only the difference of 43” is due to general 
relativity. 


Gravitational Radiation 


Just as an accelerated charge emits electromagnetic radiation that travels with the 
speed of light, an accelerated mass emits gravitational radiation that also travels 
with the speed of light. In effect, gravity waves are ripples that travel through 
spacetime. Waves produced by such motions as the planets around the Sun are 
exceedingly weak and beyond any hope of detection. Cataclysmic events in the 
universe, such as supernova explosions, and highly accelerated systems, such as 
compact binary objects, may produce observable gravitational waves. Detection 


TABLE 15.1 Precession of Perihelia 





NAg@ (arc seconds per century) 























Planet | N (orbits per century) e Tmin (10°km) |General Relativity) Observed 
Mercury 415.2 0.206 46.0 43.0 43.1+40.5 
Venus 162.5 0.0068 107.5 8.6 8.4+ 4.8 
Earth 100.0 0.017 147.1 3.8 SOE M2 
Mars 53.2 0.093 206.7 1.4 
Jupiter 8.43 0.048 740.9 0.06 
Icarus 89.3 0.827 DTS) 10.0 9.8 + 0.8 
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of these waves would provide another important confirmation of the general 
relativity theory. 

In analogy with the effect of a passing electromagnetic wave on a charge, a 
passing gravitational wave could be detected by its effect on matter. Several anten- 
nas have been built to search for gravity waves, but no conclusive experimental 
evidence has yet been obtained. Indirect evidence has come from the observations 
of the change in the orbital period of a binary pulsar (see next section). Interfer- 
ometric techniques are being used to build new detectors to search for gravity 
waves. The Laser Interferometer Gravitational-wave Observatory (LIGO), which 
began operation in 2001, consists of two installations (located in the states of 
Washington and Louisiana) whose interferometer arms are 4km in length. A 
passing gravitational wave would cause a small change in the length of one arm 
relative to the other, which would be detected through a change in the fringe 
pattern similar to that of the Michelson interferometer (Section 2.2). 


15.6 STELLAR EVOLUTION AND BLACK HOLES 


Although the large gravitational field of the Sun has provided several good tests 
of general relativity, the most stringent tests will come from measurements in 
even larger gravitational fields, where the curvature of spacetime is significantly 
greater. Such large gravitational fields can occur following the collapse of a star 
into a more compact object: a white dwarf, a neutron star, or a black hole. 

We considered the collapse of an ordinary star like the Sun to a white dwarf star 
as an example of the application of Fermi-Dirac statistics in Section 10.7. As the 
supply of hydrogen fuel in a star begins to be used up, the star will contract because 
the radiation pressure that opposes gravitational collapse is reduced. Eventually 
the stable white dwarf stage can be reached, at which the Pauli principle applied 
to the electrons prevents further collapse. The average density of a white dwarf 
prrAharmrrrrnnrAprarraa| star such as Sirius B is about 10° kg/m’, which is about 10° times the average 


density of the Sun. 





—| -1.34s 














0 10 20 If the star has a mass greater than about 1.4 solar masses (the Chandrasekhar 
Time (s) limit), the gravitational force is sufficient to overcome the Pauli repulsion of the 

electrons, and further collapse can occur. For a star of this mass, the Fermi energy 

10.714 s of the electrons (Eq. 10.50) is 0.30 MeV. Higher-energy electrons in the tail of 


the Fermi-Dirac distribution will have sufficient energy to produce the inverse 
beta-decay reaction: 








e +p>nt+y, 





0 2 4 6 8 
Time (s) for which the threshold energy is 0.782 MeV, not too far above Ep. This reaction 


removes some electrons from the star, reducing the effects of Pauli repulsion, and 
FIGURE 15.24 — The radio signals allowing the star to collapse a bit. The Fermi energy increases, pushing more 
from two different pulsars. The top electrons above the 0.782-MeV threshold, resulting in more electrons being lost, 
signal is the record of the first pulsar and so on, until all (or very nearly all) of the electrons vanish. The star is now 
discovered. composed of neutrons, instead of protons and electrons. The Pauli repulsion of the 
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electrons no longer can oppose gravitational collapse, and so the star contracts until 
the Pauli principle applied to the neutrons (which also obey Fermi-Dirac statistics) 
prevents further collapse. As we calculated in Section 10.7, a neutron star of 1.5 
solar masses would have a radius of 11 km anda density ofabout 5 x 10!” kg/m’. 

Are these neutron stars merely figments of the physicist’s imagination or do 
they really exist? In 1967, radio astronomers at Cambridge University discovered 
an unusual signal among their observations—a regular pulsation, such as is shown 
in Figure 15.24, with a period of 1.34 s. No previously known astronomical object 
could produce such sharp and regular pulses, and at first the Cambridge group 
suspected that they might have discovered signals from an extraterrestrial intelli- 
gent civilization. (The object emitting the pulses was at first called LGM-1; LGM 
stands for “Little Green Men.”) This notion was later discarded (unfortunately) 
and the object became known as a pulsar. Since 1967, hundreds of other pulsars 
have been discovered; all have extremely regular periods typically in the range 
0.01-1s. 

The connection between pulsars and neutron stars was made soon after their 
discovery. The collapse of a rotating star to a neutron star causes the neutron star 
to rotate much more rapidly. Angular momentum is conserved during the collapse 
(see Problem 11), so the rotational angular velocity increases as the rotational 
inertia decreases. A relatively slow rotation rate of the original star can become a 
very rapid rotation rate for the neutron star. 

The intense magnetic field of such a rapidly rotating object traps any emitted 
charged particles and accelerates them to high speeds, especially near the magnetic 
poles, where they give off radiation (Figure 15.25). As the star rotates, this beam 
of emitted radiation sweeps around like a searchlight or a lighthouse, and we see 
a pulse of radiation whenever the beam sweeps through the Earth. The observed 
interval between the pulses is, according to this interpretation, the rotational 
period of the neutron star. 

If this explanation of a pulsar as a rotating neutron star is correct, we ought to 
see the pulsars slowing down somewhat, as the radiated energy is compensated 
by a decrease in the neutron star’s rotational kinetic energy. This effect has been 
seen for nearly all pulsars, and amounts to about | part in 10° per day. 

Pulsars have now been observed at many different wavelengths (optical, X 
ray, y ray, radio) and with such great precision of timing that the slowing down 
of 10~° per day is easily observable. 

Although the exact mechanism of the collapse of a star to a neutron star is not 
yet understood, we suspect that the violent explosions known as supernovas leave 
a neutron star as a remnant. In 1054, Chinese astronomers observed a supernova 
explosion (which they called a “guest star”) that was visible in the daytime over 
many days. Today we see the expanding shell of that explosion as the Crab 
Nebula (Figure 15.26). At the center of the Crab Nebula is a pulsar, rotating with 
a frequency of 30 Hz. It is remarkable that none of the many photographs of the 
Crab that were taken before 1967 revealed this pulsar blinking on and off every 
0.033 s; all of these photographs were taken over long exposure times, and so 
the pulsations were not observable. When careful measures are taken, however, 
the blinking effect can be seen quite clearly (Figure 15.27). This suggests that, at 
least in this instance, pulsars may be identified as supernova remnants. 
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FIGURE 15.25 Charged _ particles 
trapped by the magnetic field lines of 
a neutron star are given large acceler- 
ations near the magnetic poles, from 
which a directional radiation beam 
emerges. If this radiation beam inter- 
cepts the Earth as the neutron star 
rotates, we see it as a pulse of radiation. 





FIGURE 15.26 
remnant of a supernova observed in 
the year 1054. 


The Crab Nebula, 
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FIGURE 15.27 The visible pulsar of the Crab Nebula. The two exposures show the pulsar blinking on and off relative to the 
other stars in the photograph. 


A Binary Pulsar 


In 1974, an unusual pulsar was discovered. The period of the pulsar was measured 
to be 59 ms, making it among the fastest observed up to that time. More surprising, 
the pulse rate appeared to be slowing down by about 0.1% per hour, but later was 
observed to be increasing by about the same amount. It was quickly realized that 
the decrease and increase of the pulse rate could be explained as a Doppler shift 
if the neutron star were moving first away from and later toward the Earth. To 
move in this way, the pulsar must be in orbit around an unseen companion. Thus 
we have a pulsar as part of a binary star system, or a “binary pulsar.” 

The orbital period of the binary system was determined to be about 8 hours. 
This is an extremely short period; for example, it is more than 250 times shorter 
than the orbital period of Mercury, the fastest moving planet in our solar system. 
To have such a short orbital period, the pulsar must be orbiting very close to its 
companion (which is believed to be another neutron star). At such close distances, 
the curvature of spacetime is large and the effects of general relativity should 
be measurable. In effect, the binary pulsar provides us with a “general relativity 
laboratory.” 

Among the general relativity effects that have been observed in the binary 
pulsar system is the delay of the pulses due to the curvature of spacetime. This 
situation is similar to Figures 15.20 and 15.21, except that the pulsar (instead of 
Venus) is the origin of the signals and the curvature is due to the companion 
star (instead of the Sun). An effect analogous to the precession of Mercury’s 
perihelion has also been observed (except that, in the case of a star, the point of 
closest approach is called periastron rather than perihelion). The periastron of the 
binary pulsar changes by 4.23° per year, about 35,000 times more rapidly than 
that of Mercury; the change of the periastron is known to an accuracy of about 1 
part in 10°, three orders of magnitude more precisely than Mercury’s. 

The most remarkable observation from the binary pulsar is the slowing of 
its orbital period, which general relativity explains as caused by the emission of 
gravitational radiation. Because of its large centripetal acceleration (its orbital 
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speed is about 0.001c), the binary pulsar should emit gravity waves, and the 
energy radiated away is compensated by a loss in the orbital energy. This loss 
amounts of 76 js per year or 67ns per orbit, and the measured change in the 
orbital rate confirmed the prediction of general relativity to within 1%. In the 
absence of direct observation, this is the strongest evidence yet obtained for the 
existence of gravitational radiation. For the discovery of the binary pulsar and its 
contributions to the study of gravitation, Joseph Taylor and Russell Hulse were 
awarded the 1993 Nobel Prize in physics. 


Black Holes 


A neutron star is not the ultimate fate of the collapse of massive stars. Stars with 
masses less than two or three solar masses probably do end up as white dwarf 
stars or neutron stars. For more massive stars, the gravitational force is strong 
enough to overcome even the Pauli principle applied to the neutrons, and there 
is nothing to prevent the material in the star from suffering complete collapse 
down to a single point in space. To understand gravitational collapse, we must 
turn again to general relativity. 

Within a year after Einstein’s 1916 publication of the general theory, Karl 
Schwarzschild worked out the solutions to the equations for the curvature of 
spacetime near a spherically symmetric mass M. In spherical coordinates (7,0, d), 
the spacetime interval for this solution is 


2GM 


2 
are? 1 “ (ty? (dr) 


| 
f§ 
er 


(Note that in the classical and zero-gravity limits c + co and G > 0, the two 
factors in square brackets disappear, leaving us with an interval in spherical 
coordinates that is the exact analog of the three-dimensional interval expressed in 
Cartesian coordinates, Eq. 15.19.) The radial part of this solution (the dr term) has 
what appears to be a serious problem: the factor in the denominator can become 
zero for a particular r, causing that term in the equation to “blow up.” This occurs 
when ¢ has the value 


1° (do)* — r* sin’ 6(do)* 








(15.26) 


2GM 
's= 3 





(15.27) 


c 


which is known as the Schwarzschild radius. None of the physical coordinates 
actually “blows up” at r = rg, and an object falling toward M would notice no 
change in its motion as it crossed the Schwarzschild radius. 

For external observers watching the falling object, the situation is very different. 
As the object falls, general relativity predicts that its clocks would appear to run 
ever more slowly, stopping completely when the object reaches rg. The object 
appears to be frozen forever at that location! While the object is falling, the light 
it emits becomes increasingly red shifted, and the red shift becomes infinite at 
r = rg, So the object disappears from view! The outside observer can obtain no 
information about the object once it passes through the Schwarzschild radius. For 
that reason, the Schwarzschild radius is often called the event horizon; no external 
observer can see into that horizon. 
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The falling object does encounter one crisis at rg. At any time before crossing 
rg, the object could reverse its fall and escape from the gravitational pull of M —for 
example, by firing its rockets. Once it passes rg, no escape is possible. Inside rg, the 
escape speed exceeds the speed of light, and nothing (not even light) can escape. 
No travel or communication is permitted from inside rg to the outside world. 
However, the object inside 7, can continue to exert a gravitational force on external 
objects or, in the language of general relativity, to curve spacetime beyond rg. 

An object whose mass M lies totally within the corresponding radius rg is said 
to be a black hole. To form such an object requires that matter be compressed 
to exceptional densities. Table 15.2 shows some values of rg for representative 
objects. For the Earth to become a black hole, we would need to compress it into 
a sphere of radius less than | cm, and the Sun would become a black hole only if 
compressed to a 3-km radius! Nevertheless, it has been speculated that black holes 
are the end products of the collapse of massive stars, and that tiny (atom-sized) 
black holes may have been formed by the extreme densities and pressures in the 
early universe. 

Far from a black hole, the gravitational field is Newtonian in character; the 
effects of curved spacetime are small, and the black hole cannot be distinguished 
from any other gravitating object. Close to a black hole (or close to any massive, 
compact object), the effects of curved spacetime can become significant. The 
discovery of a massive black hole could therefore provide another “laboratory” for 
testing the predictions of general relativity where the effects may be substantially 
larger than in the vicinity of the Sun. 

Many stars are members of binary systems, in which two stars orbit about their 
common center of mass. In many cases, a visible star appears to orbit with an 
invisible companion, and gases from the visible star emit intense X rays as they 
are accelerated toward the invisible companion. It is believed that these invisible 
companions in binary systems are black holes. Many such systems have been 
observed in our galaxy. 

By observing the rotational motion of galaxies, it is possible to deduce the 
mass at the galactic center. For some galaxies, this mass turns out to be greater 
than 10° times the mass of the Sun. No known phenomenon other than a black 
hole permits so much mass to be concentrated in so small a region. Similar 
evidence derived from rotational motions suggests that even our near neighbor, 


TABLE 15.2 Black Hole Event Horizons 
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the Andromeda galaxy, has a black hole of a few million solar masses at its center. 
Radio emissions from the center of our own Milky Way galaxy also suggest a 
black hole of a few million solar masses. 

A surprising development in black hole theory occurred in 1974, when Steven 
Hawking showed that black holes could be sources of particle emission. According 
to quantum mechanics, particle-antiparticle pairs can spontaneously appear, as 
long as they exist for a short enough time that the uncertainty principle is not 
violated. That is, the particles can “borrow” an energy of 2mc? as long as the 
loan is repaid (the particles vanish) within a time of at most At ~ h/2mc?. If 
a particle-antiparticle pair arises outside the event horizon of a black hole, its 
gravitational field can provide the energy necessary to repay the loan so that the 
particle and antiparticle can become real. Usually the particle and antiparticle 
fall back into the black hole, restoring the energy balance. However, one of the 
members of the pair may have enough energy to escape into the outside world. 
The black hole thus appears to be emitting particles. In the process, the black hole 
loses mass. The rate of mass loss is inversely proportional to the mass of the black 
hole; massive black holes that result from the collapse of stars emit particles at 
too low a rate to be observed. However, tiny black holes of atomic or nuclear size, 
which may have been formed in the early evolution of the universe, could be very 
bright sources of radiation. 

Black holes provide both a fertile area for theoretical speculation and a 
challenge for the skill of experimenters. It has been suggested that material that 
falls into a black hole reappears at another time and place in the universe, or 
perhaps in another universe. Thus a black hole, if this speculation is correct, could 
be used for time travel or to travel between different universes. Other proposals 
suggest harnessing a black hole as an energy source. It has been estimated that, if 
black holes are indeed the end products of the evolution of massive stars, there 
could be as many as 10° massive black holes in our galaxy, which makes it likely 
that many black holes are within our observational reach. Or, perhaps we will 
someday observe a minihole ending its existence with a burst of radiation. As 
we continue to refine our ability to study the skies at visible, X-ray, and y-ray 
wavelengths, black holes will figure prominently in our investigations. 


15.7 COSMOLOGY AND GENERAL RELATIVITY 


General relativity can be applied to calculate the properties of the universe as a 
whole. For this case, the mass-energy density term in Eq. 15.21 must describe the 
entire universe. We are not interested in the “local” variations in density on a scale 
of galactic size, but rather in the average density of the entire universe, evaluated 
over a distance that is large compared with the spacing between galaxies. (In 
a similar way, when we speak of the density of a solid we are interested not 
in the variations on the atomic scale but rather in the average density of the 
entire material, evaluated over a distance that is large compared with the spacing 
between atoms.) The density of the universe is not a constant; it changes with 
time as the universe expands. 
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Solving the equations of general relativity for the large-scale structure of the 
universe gives the following result, which is known as the Friedmann equation: 


dR\* 
( ) =F ang? 58 (15.28) 





dt 3 


Here R(t) represents the size or distance scale factor of the universe at time f, 
and p represents the total mass-energy density at the same time. (The density is 
expressed in mass units, such as kg/m?, even if it represents radiation.) 

The constant & that appears in Eq. 15.28 specifies the overall geometrical 
structure of the universe: k = 0 if the universe is flat, like Figure 15.14; k = +1 
if the universe is curved and closed, like Figure 15.15; & = —1 if the universe 
is curved and open, like Figure 15.16. When k = +1, the distance factor R(f) is 
directly related to the size or “radius” of the universe, but its meaning is not so 
apparent when k = 0 or k = —1, because in both of the latter cases the universe 
is infinite in extent. In these cases R(t) should be regarded as a scale factor that 
represents the expansion of space; the absolute magnitude of R in this case is not 
significant, and only its variation with time is of interest, because any particular 
length (such as the distance between two galaxies) will vary with time just as R 
does. 

To solve Eq. 15.28, we must therefore specify the constant k. On the large 
scale, our universe seems quite close to being flat (as we discuss in Section 15.10), 
and we therefore take k = 0. This simplifies the mathematics and gives results 
that are not too far different from what we obtain with k = +1, so for rough 
estimates our calculation should be acceptable. 

The density p in Eq. 15.28 must include both the matter and the radiation present 
in the universe. The present universe is dominated by matter; the contribution of 
radiation to the total density is negligible. As the universe expands, the amount of 
matter remains constant but the volume increases like R?. Thus the matter density 
Pm decreases with increasing R according to p,, «x R~>. Putting this result into 
Eq. 15.28 and integrating, we find 


RW) = AC? (15.29) 


where A is a constant. Using this result to eliminate R from Eq. 15.28, we obtain 


1 
a (15.30) 


VJ OTG Py, 


In contrast, the early universe was dominated by radiation; the mass density of 
the matter was negligible. From Eq. 10.42 we see that the energy density of the 
radiation depends on dd/A>. All wavelengths scale with R, so we have dA x R 
and 4° a R>. Thus the energy density of radiation p, decreases with increasing 
R according to p, « R~*. Inserting this result into Eq. 15.28 and integrating, we 
obtain 


R(t) = A't'/? (15.31) 


3 
— (15.32) 
327Gp, 


where 4’ is a constant, and so 
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The Hubble parameter can be defined in terms of the time variation of the scale 
factor: 


1 dR 

~ Rat 

As the universe evolves, the value of H changes. Its present value Hp is revealed 
in experiments with Hubble’s law (Eq. 15.2). 

If the universe has been expanding at a constant rate (R « f), then H~! is the 
age of the universe. In the two cases we derived above, the age is less than H~!. 
A matter-dominated universe expanding since t = 0 has an age of +H ~! while a 
radiation-dominated universe has an age of 5H ~! In either case we can take H7! 
as a rough measure of the age at any time. 

We can therefore characterize the universe by several parameters: a shape 
parameter k, which describes whether it is flat or curved, open or closed; a radius 
or scale parameter R(t), which measures the size of the universe as a function 
of time; the density », which represents both matter and energy, and which is 
also a function of time; the Hubble parameter H, which is proportional to the 
rate of expansion; and also a deceleration parameter q, which tells us the rate 
at which the expansion is slowing down (see Problem 12). The challenge to the 
observational astronomer is to obtain data on the distribution and motion of the 
stars and galaxies that can be analyzed to obtain values for these parameters. 


15.8 THE BIG BANG COSMOLOGY 


The present universe is characterized by a relatively low temperature and a low 
density of particles. Its structure and evolution are dominated by the gravitational 
force. Because the universe has been expanding and cooling, in the distant past 
it must have been characterized by a higher temperature and a greater density of 
particles. Let us imagine we could run the cosmic clock backward and examine 
the universe at earlier times, even before the formation of stars and galaxies. At 
some point in its history, the temperature of the universe must have been high 
enough to ionize atoms; at that time the universe consisted of a plasma of electrons 
and positive ions, and the electromagnetic force was important in determining the 
structure of the universe. At still earlier times, the temperature was hot enough 
that collisions between the ions would have knocked loose individual nucleons, 
so the universe consisted of electrons, protons, and neutrons, along with radiation. 
In this era the strong nuclear force was important in determining the evolution of 
the universe. At still earlier times the weak interaction played a significant role. 
If we try to go back still further, we reach a time when the matter of the 
universe consisted only of quarks and leptons. Because we have never observed 
a free quark, we don’t know much about their individual interactions, and so we 
can’t describe this very early state of the universe. If someday we are able to 
understand the interactions of free quarks, we can penetrate this barrier and look 
to still earlier times. Eventually we reach a fundamental barrier when the universe 
had an age of only 10~* s, which is known as the Planck time (see Problem 26). 
Before this time, quantum theory and gravity are hopelessly intertwined, and none 
of our present theories gives us any clue about the structure of the universe. 
Later than the Planck time, but still before the condensation of bulk matter, the 
universe consisted of particles, antiparticles, and radiation in approximate thermal 


(15.33) 
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equilibrium at temperature 7. The universe at this time was radiation-dominated: 
the energy density of the radiation exceeded the energy density of the matter. 
In a radiation-dominated universe, we can use Eq. 15.32 to find a relationship 
between the temperature and the age. Inserting the radiation density from Eq. 15.9, 
remembering to convert to mass units such that p, = U/c*, and evaluating all 
numerical factors, we obtain 


15 10" 5)? 
ee 1/2 





(15.34) 


where the temperature 7 is in K and the time f¢ is in seconds. This equation relates 
the age of the early universe to its temperature. 

The radiation of the early universe consisted of high-energy photons, whose 
average energy at the temperature 7 can be roughly estimated as AT, where k is 
the Boltzmann constant. The interactions between the radiation and the matter 
can be represented by two processes: 


photons — particle + antiparticle 
particle + antiparticle photons 


That is, photons can engage in pair production, in which their energy becomes 
the rest energy of a particle-antiparticle pair, or a particle and antiparticle can 
annihilate into photons. In each case, the energy of the photons must be at least as 


large as the rest energy of the particle and antiparticle. 


| Example 15.2 


(a) At what temperature is the thermal radiation in the 
universe energetic enough to produce nucleons and anti- 
nucleons? (6) What is the age of the universe when it cools 
to that temperature? 


Solution 
(a) Let us consider the formation of proton-antiproton or 
neutron-antineutron pairs by photons: 


y+y—a>ptp and y+y—>n4n 


To produce these reactions, the photons must have an 
energy at least as great as the nucleon rest energy, or about 
940 MeV. The temperature of the photons must then be 


E me 
T SSS 
k k 
940 MeV 


i 1 1K 
8.6 x 10-5 eV/K . 


(6) From Eq. 15.34 we can find the age of the universe 
when the photons have this temperature: 





T 
_ (= x 1019 sl/2.K 


a ( x 101° “ss 





2 
—~6 

11x 103K ) =n ae 

That is, at times earlier than 2 ws, the universe was hot 
enough for the photons to produce nucleon-antinucleon 
pairs, but after 2 zs the photons were not energetic enough 
to produce nucleon-antinucleon pairs. The annihilation 
reaction continues to occur, but after this time nucleon- 
antinucleon pair production ceases. 

In this calculation we are using average photon energies 
as estimates. Photons in the tail of the thermal spectrum are 
sufficiently energetic to produce nucleon-antinucleon pairs 
even after 2 jus, but on the average the photons have too 
little energy. More precisely, we could state that the rate 
of nucleon-antinucleon pair production drops rapidly at 
around 2 jus and becomes negligible at times much greater 
than 2 jus. 
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Let us now look at some of the major developments in the evolution of the 
universe. 


t= 10°s We begin the story at a time of 1 ws. From Eq. 15.34 we find 
T = 1.5 x 10°K or kT = 1300MeV. The scale factor is smaller than that of 
the present universe by the red shift, 2.7K/1.5 x 10° K = 1.8 x 107°. If the 
universe were closed and finite, its radius would be smaller than the present 
observable radius (107° m) by this factor, so the universe at that time is about 
the present size of the solar system (10! m). At 1 js, the universe consists of p, 
p,n,n,e~,et, w~, w*, 2°, x, x*, and perhaps other particles, plus photons, 
neutrinos, and antineutrinos. Because both pair production and annihilation can 
occur, the number of particles is roughly equal to the number of antiparticles 
for each species. Furthermore, the number of photons is roughly equal to the 
number of nucleons, which is in turn roughly equal to the number of electrons. 
The relative number of neutrons and protons is determined by three factors: 





1. The Boltzmann factor e~"*", Protons have less rest energy than neutrons, so 
there are more of them at any given temperature. The energy difference AF 
is (m, — m,)c? = 1.3 MeV, so the neutron-to-proton ratio can be expressed 
as e~!-510°/T with T in Kelvins. For T ~ 1013 K, this ratio is very nearly 1, 
but it becomes different from 1 as T approaches 10!° K. 

2. Nuclear reactions. Reactions such asn+ v, = p+e7 andn+e* @p+v, 
can go in either direction and tend to make it easy for protons to turn into 
neutrons or neutrons into protons, as long as there are plenty of e~, eT, vp, 
and D, around. 

3. Neutron decay. The neutron half-life is about 10 min, which is going to be 
important only at later times. For ¢ < 1s, there has not yet been enough time 
for an appreciable number of neutrons to decay. 


At t = 1 ws, all three of these factors keep the neutron-to-proton ratio very close 
to 1. 


t = 107s Between 10~°s and 10~?s, the temperature drops from 1.5 x 
10° K (kT = 1300 MeV) to 1.5 x 10!! K (AT = 13 MeV), and the distance scale 
factor increases by a factor of 100. The photons have on the average too little 
energy (13 MeV) to produce pions and muons, and because the pion and muon 
lifetimes are much shorter than 10~? s, they have decayed into electrons, positrons, 
and neutrinos. Pair production of nucleons and antinucleons no longer occurs, 
but nucleon-antinucleon annihilation continues. As we discuss later, there is very 
slight imbalance of matter over antimatter of perhaps 1 part in 10°. During this 
interval, all of the antimatter and most (99.9999999%) of the matter is annihilated. 
Pair production of electrons and positrons can still occur, so the universe consists of 
p, n, e~, e*, photons, and neutrinos. The neutron-to-proton ratio remains about 1. 


t= 1s Between 10~*s and 1s, the temperature drops to 1.5 x 10!°K (kT = 
1.3 MeV). In this interval, the Boltzmann factor, which determines the neutron- 
to-proton ratio, becomes different from 1; by ¢ = 1 s, the nucleons consist of about 
73% protons and 27% neutrons. During this period, the influence of the neutrinos 
has been decreasing; to convert a proton to a neutron by capturing an antineutrino 
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(v, + p > n+et) requires an antineutrino of at least 1.8 MeV, above the mean 
neutrino energy (1.3 MeV) at this temperature. This begins the time of “neutrino 
decoupling,” when the interactions of matter and primordial neutrinos no longer 
occur. From this time on, the neutrinos continue to fill the universe, cooling 
along with the expansion of the universe. These primordial neutrinos presently 
have roughly the same density as the microwave photons, but a slightly lower 
temperature (about 2 K). 


t=6s Between | s and 6s (7=6~x 10°K or kT = 0.5 MeV), the average 
photon energy decreases and becomes insufficient to produce electron-positron 
pairs. Electron-positron annihilation continues, and as a result all of the positrons 
and nearly all (99.9999999%) of the electrons are annihilated. The electrons 
have too little energy to convert protons to neutrons (e& + p > n+ v, no longer 
occurs), and so the only remaining weak interaction process that influences the 
relative number of protons and neutrons is the radioactive decay of the neutron, 
which has a half-life of 10 minutes and so has not appreciably occurred by this 
time. The nucleons are now about 84% protons and 16% neutrons, or about 5 
times as many protons as neutrons. 

The composition of the universe after t= 6 s consists of some number NV 
protons, the same number N electrons, and about 0.2N neutrons. There are no 
remaining positrons or antinucleons. Because particle-antiparticle annihilation 
has substantially reduced the number of nucleons while the number of photons 
remained stable, there are about 10°N photons (and about the same number of 
neutrinos). 


| Example 15.3 





Estimate the relative number of neutrons and protons _ so the ratio of neutrons to protons is 


among the nucleons at ¢ = |s. ' 

Solution ra = MINE = oN = 0.37 
At this time, the temperature is 1.5 x 10!° K. The neutron- : 

to-proton ratio is determined by the Boltzmann factor, The relative number of protons is then 
e AE/KT where AE is the neutron-proton rest energy dif- 

ference. The exponent in the Boltzmann factor is No it 1 


N,+N, 1+N,/N,  1+0.37 








AE | 1.3 MeV 7 
kT (8.62 x 10-5 eV/K) (1.5 x 10!°K) 





1.0 
The nucleons consist of 73% protons and 27% neutrons. 





15.9 THE FORMATION OF NUCLEI AND ATOMS 


Let’s review developments in the Big Bang cosmology up to t = 6s. (1) A hot, 
dense universe, full of photons and elementary particles of all varieties, has cooled 
to below 10!°K. (2) Most of the unstable particles have decayed away. (3) All 
of the original antimatter and most of the original matter annihilated one another, 
leaving a small number of protons, an equal number of electrons, and about 
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one-fifth as many neutrons. (4) Neutrinos, which have about the same density 
as photons, decoupled at about | s and will continue cooling as the universe 
expands. 

As the neutrons and protons collide with one another, it is possible to form a 
deuteron (7H nucleus): 


n+p—> 7H+y 
but the high density of photons can also produce the inverse reaction: 
y+?H> n+p 


We recall from Chapter 12 that the deuteron binding energy is 2.22 MeV. In order 
to have any appreciable buildup of deuterons, the photons present must first cool 
until their energies are below 2.22 MeV; otherwise the deuterons will be broken 
up as quickly as they can be formed. The energy 2.22 MeV corresponds to a 
temperature T = 2.5 x 10!°K, and we therefore might expect deuterons to be 
formed as soon as the temperature drops below 2.5 x 10!°K. However, this does 
not happen. The radiation does not have a single energy, but rather has a thermal 
spectrum. A small fraction of the photons has energies above 2.22 MeV, and these 
photons continue to break apart the deuterons (Figure 15.28). 

Before matter-antimatter annihilation occurred, there were about as many 
photons as nucleons and antinucleons, but after t = 0.01 s, the ratio of nucleons 
to photons is about 10~?: about 7 of the nucleons are neutrons. If the fraction 
of photons above 2.22 MeV is greater than i x 107°, there will be at least one 
energetic photon per neutron, which effectively prevents deuteron formation. Our 
next job is to calculate to what temperature the photons must cool before fewer 
than r x 107? of them are above 2.22 MeV. 

The number density of thermal photons was given by Eq. 15.6. We expect that 
the temperature must be much less than 2.5 x 10!° K, and so we are interested in 
the distribution where E >> kT, for which it is approximately 


N(E)dE — 8xE? par 
——— = —e # dE 15.35 
V (hos E599) 


and the total number density above some energy Ey is determined by integrating 
the number density from Ep to oo: New g,/V = i N(E)dE/V, which can be 
shown to be 


Ne> ko 8x 3B /kr | ( Zo : Ey 
7 = Gaitnre (3) ney (2) 42 (15.36) 








Equation 15.8 gives the total number density of photons, and thus the fraction f 
above Ey is (Nes E)/V)/ (N/V), which can be evaluated to be 


Ne mlY Falk E ' E 
= 9 Wagan i ||| a 2 ae 15. 
f= (2 #2( | (15.37) 


For f = i x 10~°, corresponding to the number needed to prevent deuteron 
formation, Eq. 15.37 gives Eg/kT = 28. With Ey = 2.22 MeV, the required 
temperature is thus about 9 x 108K; when 7 > 9 x 10° K, the number of photons 
with E > 2.22 MeV is greater than the number of neutrons, and deuteron (*H) 
formation is prevented. When 7 drops below 9 x 10° K (which occurs at about 
t = 250s), deuterons can be produced. 


NE) 











Eo 
Energy 


FIGURE 15.28 The thermal radiation 
spectrum. The photons above Ey = 
2.22 MeV are sufficiently energetic to 
break apart deuterium nuclei. 
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From 6 s to 250 s, very little (except expansion and the corresponding 
temperature decrease) happens in the universe, but after ¢ = 250s things happen 
very quickly. Deuterons form and then react with the many protons and neutrons 
available to give 


"H+p > *He+y and *H+n > 7H+y 


The energies of formation of these nuclei are, respectively, 5.49 MeV and 
6.26 MeV, well above the 2.22 MeV threshold of the deuteron formation. If the 
photons are not energetic enough to break apart the deuterons, they are certainly 
not energetic enough to break apart *He and 7H. The final steps in the formation 
of the heavier nuclei are 


[He +n > *He+y and 3H+p > *He+y 


There are no stable nuclei with A = 5, so no further reactions of this sort are 
possible. Nor is it possible to have tHe +*He reactions because *Be is highly 
unstable. (It would be possible to form stable °Li and ’Li, but these are made in 
very small quantities relative to H and He; from Li further reactions are possible, 
such as 7Li +*He —>!'B, and so forth, but these occur in still smaller quantities. 
The end products *H and He, along with the leftover original protons, make up 
about 99.9999% of the nuclei after the era of nuclear reactions.) 

By t = 250s, the original 16% neutrons present at tf = 6s had beta-decayed to 
about 12%, leaving 88% protons. Most of the 7H, 7H, and *He were “cooked” 
into heavier nuclei, so we can assume the universe to be composed mostly of 
'H and +He nuclei. Of the N nucleons present at tf = 250s, 12% (0.12N) were 
neutrons and 0.88N were protons. The 0.12N neutrons combined with 0.12N 
protons, forming 0.06N *+He, and leaving 0.88N — 0.12N = 0.76N protons. The 
universe then consisted of 0.82N nuclei, of which 0.06N (7.3%) were *He and 
0.76N (92.7%) were protons. Helium is about four times as massive as hydrogen, 
so by mass the universe is about 24% helium. 

At this point the universe began a long and uneventful period of cooling, during 
which the strong interactions ceased to be of importance. 

The final step in the evolution of the primitive universe is the formation of 
neutral hydrogen and helium atoms from the 'H, 7H, *He, and *He nuclei and the 
free electrons. In the case of hydrogen, this takes place when the photon energy 
drops below 13.6eV; otherwise any atoms that might happen to form will be 
immediately ionized by the radiation. There are still about 10° photons for every 
proton, and so we must wait for the radiation to cool until the fraction of photons 
above 13.6eV is less than about 10~°. We can solve Eq. 15.37 for f = 10~? 
to obtain Ey/kT = 26. With E, = 13.6eV, the corresponding temperature is 
T = 6070K, which occurs at time ¢t = 6.1 x 10!*s = 190,000y. These final 
estimates are actually not quite correct. We have been considering only the energy 
density of radiation present in the universe. As the universe cools, the contribution 
of the matter to the total energy density becomes more significant, and so the 
temperature drops more slowly than we would estimate. This contribution may 
increase this time by about a factor of 2 to about 380,000 y, and the radiation 
temperature is decreased by about a factor of /2, to T = 4300K. 

After neutral atoms have formed, there are virtually no charged particles left 
in the universe, and the radiation field is not energetic enough to ionize the atoms. 
This is the time of the decoupling of the radiation field from the matter, and now 
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FIGURE 15.29 Evolution of the universe according to the Big Bang cosmology. The blue line shows the temperature and time 
in the radiation-dominated era before decoupling. The most important reactions in each era are shown. 


electromagnetism, the third of the four basic forces, is no longer important in 
shaping the evolution of the universe. The large-scale development of the universe 
is from this point governed only by gravity. 

The time after f = 380,000 y has been comparatively uneventful, at least from 
the point of view of cosmology. Density fluctuations of the hydrogen and helium 
triggered the condensation of galaxies, and then first-generation stars were born. 
Supernova explosions of the material from these stars permitted the formation of 
second generation systems, among which planets formed from the rocky debris. 

Meanwhile, the decoupled radiation field, unaffected by the gravitational 
coming and going of matter, began the long journey that eventually took it, cooled 
again by a factor of 1600, to the radio telescopes of 20th-century Earth. 

The details of the Big Bang cosmology are summarized in Figure 15.29. It 
is a remarkable story, all the more so because we can understand most of its 
details—with the possible exception of the first instant—with nothing more than 
some basic theories of modern physics, most of which we can study (on a much 
smaller scale!) in our laboratories on Earth. 


15.10 EXPERIMENTAL COSMOLOGY 


Far from being a science that involves only speculations about the distant past or the 
indefinite future, cosmology has in recent decades become a precise experimental 
science, involving observational results from high-resolution observatories on 
Earth and in space, as well as laboratory measurements of nuclear and particle 
properties that provide insight into cosmological phenomena. Here are a few of 
the observations and their implications. 
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Matter and Antimatter 


In the early universe, there was roughly one nucleon and one antinucleon for each 
photon. If the numbers of nucleons and antinucleons had been exactly equal, there 
would have been either complete annihilation of both (in which case we would 
not be around to comment on the outcome) or else the clumping of matter and 
antimatter into galaxies and antigalaxies. Our telescopes can’t tell the difference 
between galaxies made of matter and antimatter (because both emit the same 
light), but if there were large quantities of antimatter in the universe we should 
occasionally find a galaxy and an antigalaxy colliding, and their annihilation 
would light up the sky. We observe many galaxies in the process of colliding with 
their neighbors, but none show the intense annihilation radiation that would signal 
a matter-antimatter collision. Our conclusion is that the universe is made of matter 
and contains no significant concentrations of antimatter. For every 1,000,000,000 
nucleons in the early universe there were 999,999,999 antinucleons; following 
the annihilation all of the antinucleons disappeared, leaving 1 out of the original 
10° nucleons to make up the current universe. 

After the matter-antimatter annihilation in the early universe, the ratio of the 
number of remaining nucleons to photons was about 10~°. This number, which 
has remained constant since the annihilation era, is deduced from the measurement 
of the relative density of 7H and *He in sites such as “first-generation” stars or 
interstellar gas, where no significant additional amounts of those atoms have 
been subsequently produced by fusion. The observed relative abundances of these 
atoms gives a nucleon to photon ratio in the range of 5—7 x 107!°. 

Where did the 10~° excess of matter over antimatter in the early universe 
come from? We don’t yet know the answer to this question, but evidence gathered 
in particle physics experiments may provide a clue. The first indication of an 
asymmetry between matter and antimatter was a 1964 experiment studying the 
decay of the neutral K meson, which shows a difference in behavior between K® 
and K only at the very low level of 1 part in 10° in the weak interaction or 
thus at a level of 10~!° relative to the strong interaction. (J. W. Cronin and V. L. 
Fitch received the 1980 Nobel Prize in physics for their work on this experiment.) 
Following the discovery of the b quark, it was hypothesized that the B°? meson 
would show a similar effect, and so accelerators were built in the U. S. and Japan 
that produced the B° in large enough quantities to verify the asymmetry. Beginning 
in about 2001, scientists at these accelerators announced results that verified the 
matter-antimatter asymmetry previously observed only in the K° decay. 

The distinction between matter and antimatter occurred at an early stage in 
the evolution of the universe, during the quark-antiquark era. The Grand Unified 
Theories (GUTs) include this asymmetry between quarks and antiquarks in a 
natural way, although there is as yet no accepted version of the GUTs that yields 
a convincing explanation for the K° and B® experiments. 


Helium Abundance 


Much of the matter in the universe has been formed and reformed, and so has 
lost its “memory” of the Big Bang. There is, however, “first generation” matter 
in stars and galaxies, that should show the roughly 24% helium abundance that 
characterized the formation of matter. 

A variety of experiments suggests that the abundance of helium in the universe 
is 23 to 27% by mass, in excellent agreement with our rough estimate of 24%. 
These experiments include the emission of visible light from gas clouds near stars 
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and the emission of radio waves by interstellar gas, both of which permit us to 
compare the amounts of hydrogen and helium present. In addition, the dynamics 
of stellar formation depends on the initial hydrogen and helium concentrations; 
present theories permit us to estimate their ratio from the observed properties of 
stars. The 24% abundance seems to be rather constant throughout the universe, as 
we would expect if it were predetermined by the Big Bang. (Not enough helium 
has been produced by nuclear fusion in stars in the last 14 x 10° y to change this 
ratio significantly.) 

In fact (and here physics comes nearly full circle, from the very old and large to 
the very new and small), the early helium abundance is a function of the conditions 
before 10~° s, when quarks and leptons filled the universe. The evolutionary rate 
in this era depends on the number of different kinds of quarks and leptons that 
can participate in reactions. It has been calculated that the helium abundance 
is probably not consistent with the existence of more than three generations of 
quarks and leptons. It is remarkable that extrapolations to an unobservable state 
of the universe can yield such insight into the fundamental structure of matter. 


The Horizon Problem 


Our telescopes permit us to look outward by about 10 billion light-years in any 
direction. No matter in what direction we look, the universe (which we are 
viewing as it looked 10 billion years ago) appears pretty much the same—the 
same types of galaxies and the same temperature of the background radiation. 
This is surprising, because regions that we observe in opposite directions are 
separated by 20 billion light years, while the universe is only 14 billion years old. 
If the universe had been expanding throughout its history at a uniform rate, those 
opposite regions of the sky could never have been connected by any signal and 
thus had no way to achieve the common characteristics that we now observe. 
(Imagine finding a block of copper that had been assembled from a random 
collection of copper atoms. If all parts of the block were at the same temperature, 
you would conclude that the block had been in existence for a long enough time 
for thermal energy to propagate throughout its volume. If we learned that the time 
since the block’s assembly was less than that propagation time, it would be very 
puzzling to explain the achievement of thermal equilibrium in so short a time.) 

This paradox is solved by a hypothesis called “inflation,” which proposes that 
in the early universe rather than a constant expansion rate there was a sudden 
rapid growth (by perhaps 50 orders of magnitude) in a short interval of time 
between 10~*> s and 10~*? s. Before the time of inflation, the size of the universe 
was less than the distance through which distant parts could exchange energy 
since the time of the Big Bang. Thus all parts of the universe were able to 
achieve a common set of characteristics. After inflation, the size of the universe 
exceeded the maximum range of communication signals, but the homogeneous 
characteristics had already been achieved. The inflationary hypothesis thus neatly 
solves the horizon problem. 


The Flatness Problem 

For a flat universe (kA = 0), we can combine Eqs. 15.28 and 15.33 to give 
_ 3H? 
 82G 


This is the critical density corresponding to a flat universe. If the density is greater 
than this critical value, the universe is closed, and if it is less than this value, the 


Per = 0.97 x 10776 kg/m? (15.38) 
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universe is open. In discussing the density of the universe, it is useful to define 
the ratio between the actual density and this critical value: 


Gas (15.39) 


Por 
As the universe expands, the gravitational interaction among its components slows 
the expansion rate. If Q > 1, the gravitational interaction will eventually halt and 
reverse the expansion. If Q < 1, the expansion will continue until the components 
are separated by infinite distances. If Q is exactly equal to one, the expansion will 
also continue forever, but the components will arrive at their infinite separation 
just as they lose the last bit of kinetic energy. 

By carefully measuring the variations in the temperature of regions of the 
microwave background, the WMAP satellite and other experiments have con- 
cluded that Q is very close to 1, probably within 1%. It is of considerable interest 
to know whether Q is exactly equal to | (for a flat universe), or just happens to be 
very close to | (for an open or closed universe). 

By way of analogy, consider a projectile that is thrown upward from the surface 
of the Earth (ignore the gravity of the Sun and all other objects). The parameter 
Q in effect measures the ratio between the gravitational potential energy and 
the kinetic energy: Q = |Ug.ay|/K. If the initial value of Q is greater than 1, the 
gravitational energy exceeds the kinetic energy, so the projectile will rise to a 
maximum height and then fall back to Earth. When it reaches its maximum height, 
K = 0 and Q becomes infinite. During the entire ascent, the value of Q increases 
because the kinetic energy decreases more rapidly than the magnitude of the 
gravitational energy. If the projectile is launched so that Q < 1, there is more than 
enough kinetic energy to overcome the Earth’s gravity, and the projectile will 
escape the pull of the Earth. When it reaches infinite separation, Q = 0 because 
Ugray = 0. During its entire outward journey, &2 decreases from its initial value 
and approaches zero. If we choose the initial velocity such that Q = 1, there is 
just enough energy to escape, and the projectile reaches infinite separation with 
K = 0. Throughout the entire journey, 92 remains exactly 1. 

For the projectile as well as for the evolution of the universe, the conclusions 
are identical: If = 1 initially, it remains exactly | always, but if either Q> 1 
or Q < 1, it grows further away from 1. If the early universe had Q = 1.000001, 
after the passage of 14 billion years Q would have grown very large; similarly, if 
the initial value of Q were 0.999999, by now it would be very close to 0. It has 
been calculated that for Q to be within 1% of | today, it must have originally been 
in the range 1 + 10~°*. Here again the inflation hypothesis is essential. Prior to 
the inflation era, the universe may have been open, flat, or closed, with any value 
of Q. During inflation, the universe grew by so many orders of magnitude that the 
curvature became flat, much as the surface of a balloon becomes nearly flat when it 
is inflated by many orders of magnitude. As a result, 2 was indeed very close to | 
just after inflation, and we continue today to observe an Q that remains close to 1. 


The Composition and Age of the Universe 


During the past 10 years, researchers have made enormous strides in determining 
the composition of the universe and its age. These determinations are based mostly 
on measurements of the properties of the cosmic microwave background radiation, 
in particular its geometrical distribution and its polarization properties. Among 
the most detailed experiments are those of the WMAP satellite (2001) and the 
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Boomerang balloon flights over Antarctica (1998 and 2003). These experiments, 
along with others, indicate that the universe is very nearly flat; that is, Q = 1.00 
(so that p = /,,) to within about 1%. They are also able to measure the relative 
densities of the various components of the mass-energy content of the universe. 
Ordinary baryonic matter contributes about 4.6% of the critical density, and dark 
(nonbaryonic) matter contributes 23%. Together, the two kinds of matter make 
up 28% of the critical density. If the density of the universe is equal to the critical 
density, what makes up the other 72%? 

Beginning in about 1998, two teams of researchers were investigating the 
Hubble law by studying the exploding stars known as supernovas in the most 
distant galaxies (corresponding to large redshifts, with A /d close to 0.9). Both 
teams found systematic and consistent departures from the Hubble law. The 
supernovas in these distant galaxies were fainter than expected, indicating that 
the galaxies were 10 to 15% farther from us than the Hubble law predicts. The 
research groups concluded that a mysterious force is accelerating the expansion 
of the universe. Generally we would expect that the expansion would be slowing 
down, owing to the gravitational interactions of its components. What could be 
responsible for increasing the rate of the expansion? 

This unknown interaction is called “dark energy,” and it represents the missing 
72% of the composition of the universe. Although there are several theories about 
the nature of the dark energy, there is no convincing explanation of its origin or 
its role in the physical world. It has been suggested that the dark energy density 
does not diminish as the universe expands, so that as the densities of matter (both 
baryonic and non-baryonic) decrease with the expansion, eventually the dark 
energy begins to dominate and accelerates the expansion. We live at a time when 
this acceleration is dominant (which has occurred for about the past 5 billion years). 

Another finding based on the observation of the background radiation is the age 
of the universe. There is general agreement that the age is 13.7 x 10° years, with an 
uncertainty of about 1%. The accelerated expansion solves a problem associated 
with the Hubble age. In a matter-dominated universe (which our universe has been 
for most of its existence), the present Hubble age should be $Hy 1 which works 
out to be about 9 x 10° years. If the expansion has been accelerating, then the 
actual age can be greater than the Hubble age, which certainly seems to be the case. 

It is fitting that this story ends where it began, with Einstein. When Einstein 
produced the general relativity theory in 1916 (a decade before Hubble’s work), 
it was widely believed that the universe was static. In order for the equations of 
general relativity to allow a static solution, Einstein introduced into his equations 
an additional term, called the cosmological constant. After learning of Hubble’s 
discovery of the expansion of the universe, Einstein called the introduction of the 
cosmological constant his “greatest blunder.” It now appears that the cosmological 
constant is one possible explanation for the dark energy, and the term has been 
restored to the equations. 

The increasing role of the dark energy suggests a sad fate for the universe. 
An open universe will expand forever, but in an accelerating open universe each 
observer’s horizon will shrink increasingly rapidly. The most distant galaxies will 
separate from us at speeds greater than the speed of light (which is not a violation 
of special relativity, because no signals are being exchanged). The light from 
these distant galaxies will not be able to reach us, and the galaxies will gradually 
disappear. Future astronomers might be able to observe only local galaxies and 
might learn nothing of the expansion or the properties of the universe! 
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If we were to measure the equivalence of gravitational and 
inertial mass, would we show that mjnertiaj = 7 
merely that Minertial © Meravitational ? 

Do tidal effects distinguish between Newtonian gravity and 
curved spacetime? What would be the shape of a drop of 
liquid following a path in a curved spacetime? Can such a 
drop distinguish between a uniform gravitational field and a 
uniform acceleration? 

Suppose that the first measurement of deflection of starlight 
during a solar eclipse had been done after 1905, when the 
special theory of relativity was introduced, but before 1916, 
when the general theory was introduced. What would have 
been the effect of this measurement on the special theory? 
If we could make a precise comparison of light from the 
Sun with light from the Moon, would the moonlight be red 
shifted, blue shifted, or unshifted relative to sunlight? 

What difficulties might arise in the Pound and Rebka exper- 
iment on the gravitational red shift if the temperature of the 
source or the absorber varied? 

Why are the abundances of Li, Be, and B so small? 

Can we look out into the distant universe without also 
looking back into time? 

Is Hubble’s parameter a constant? Does it vary over large 
distances of space? Over long intervals of time? 


gravitational or 


Explain why the age of the universe must be less than H~!. 
Why is it difficult to obtain precise values for the Hubble 


All natural processes are governed by the rule that the 
entropy must increase; the increase of entropy, as the uni- 
verse “runs down,” defines for us a direction of time. If the 
universe begins to contract and therefore to heat up, will 
the entropy of natural processes therefore decrease? Will 
the inhabitants of that universe observe time to be running 


The hydrogen in the universe contains a small fraction of 
deuterium. Assuming the deuterium originated in the Big 
Bang, what era of the Big Bang would we learn about by 
measuring the deuterium abundance? Can we accomplish 
this measurement using terrestrial hydrogen? What proper- 
ties of deuterium could we use to determine its presence in 


Between t= 1s and t= 6s, the neutron fraction should 
drop from 27 to 8%; instead it drops only to about 16%. 
Why don’t more neutrons turn into protons during this era? 
Is it as difficult for protons to turn into neutrons? 


0; 
10. 
parameter? 
11. 
backward? 
12. 
distant regions of the galaxy? 
13. 
14. 


If we were able to observe the neutrinos from the early 
universe, would they have a spectrum determined by the 
Planck distribution? 


15.1 
1. 


15.2 


15.3 


15.4 
6. 


15.5 
9. 


The Expansion of the Universe 


Use Hubble’s law to estimate the wavelength of the 590.0 nm 
sodium line as observed emitted from galaxies whose dis- 
tance from us is (a) 1.0 x 10° light-years; (b) 1.0 x 10° 
light-years. 

The light from a certain galaxy is red-shifted so that the 
wavelength of one of its characteristic spectral lines is dou- 
bled. Assuming the validity of Hubble’s law, calculate the 
distance to this galaxy. 


The Cosmic Microwave Background Radiation 


(a) Taking u(E) = EN(E) as the energy density of the ther- 
mal radiation, with N(£) given in Eq. 15.6, differentiate 
to find the energy at which the maximum of the radiation 
energy spectrum occurs. (b) Evaluate the peak photon energy 
of the 2.7-K microwave background. 

Starting with Eqs. 15.7 and 10.41, show how to evaluate the 
numerical constants that appear in Eqs. 15.8 and 15.9. 


Dark Matter 


Suppose an observer in a distant galaxy were observing 
the light from our Sun as the Sun moves directly toward 
the observer. Neglecting any net relative motion of the two 
galaxies, calculate the change in wavelength of the 121.5-nm 
Lyman series line due to the rotation of our galaxy. 


The General Theory of Relativity 


In Example 15.1 we calculated the change in wavelength of 
the Lyman a line due to the gravitational red shift. Compare 
this value with (a) the special relativistic Doppler shift due 
to the rotation of the Sun and (6) the thermal Doppler broad- 
ening (see Eq. 10.30). The Sun’s radius is 6.96 x 10° m, its 
rotational period is 26 days, and it surface temperature is 
6000 K. 

A satellite is in orbit at an altitude of 150km. We wish to 
communicate with it using a radio signal of frequency 
10° Hz. What is the gravitational change in frequency 
between a ground station and the satellite? (Assume g 
doesn’t change appreciably.) 

According to the uncertainty principle, what is the min- 
imum time interval necessary to measure a change in 
frequency of the magnitude observed in the Pound and Rebka 
experiment? 


Tests of General Relativity 


By drawing analogies between the Coulomb force law and 
the gravitational force law, use Eq. 6.8 for the deflection in 
Rutherford scattering to obtain Eq. 15.22 for the deflection 
of photons. Assume the photon behaves as if it has a mass 
m = E/c*. (Hint: Write Eq. 6.8 in terms of the velocity of 
the particle instead of kinetic energy.) 


10. 


15.6 
11. 


15.7 
12. 


15.8 


13. 
14. 


15, 


15.9 


16. 
17. 


18. 


Problems 515 


In the binary star system known as PSR 1913+ 16, two 
neutron stars move about their common center of mass in 
highly elliptical orbits. Locate the orbital parameters for this 
motion, and add a row to Table 15.1 showing the precession 
angle expected from general relativity. (Hint: In Eq. 15.25, 
M is the total mass of the orbiting body and the central body.) 


Stellar Evolution and Black Holes 


(a) Show that Eq. 10.57 for the radius of a neutron star of 
mass M can be written R = (12.3 km)(M/M,)7'? where 
M, is the mass of the Sun. (b) Consider a star 1.5 times 
as massive as the Sun with a radius of 7 x 10° km (equal 
to the present radius of the Sun), rotating on its axis about 
once per year. (This is quite a slow rate of rotation—our 
Sun rotates about once per month.) If angular momentum 
is conserved in the collapse, what will be the final angular 
velocity? Assume the star can be represented as a sphere of 
uniform density, with rotational inertia J = 2MR?. 


Cosmology and General Relativity 


The rate of change of the cosmic expansion can be 
described in terms of a deceleration parameter gq = 
—R(d?R/dt*)/(dR/dt)*. (a) Evaluate q for the matter- 
dominated universe (Eq. 15.29) and the radiation-dominated 
universe (Eq. 15.31). (6) By differentiating Eq. 15.28, 
show that in a matter-dominated universe q = 47 Gp,,/3H?. 
(Hint: Use p,, 0 R7 to relate dp,,/dt to dR/dt.) 


The Big Bang Cosmology 


Derive Eq. 15.34. 

At what age did the universe cool below the threshold 
temperature for (a) nucleon production; (b) pi meson pro- 
duction? 

(a) At what temperature was the universe hot enough to per- 
mit the photons to produce K mesons (mc? = 500 MeV)? 
(b) At what age did the universe have this temperature? 


The Formation of Nuclei and Atoms 


Derive Eqs. 15.36 and 15.37. 

Suppose the difference between matter and antimatter in the 
early universe were | part in 10° instead of 1 part in 10°. 
(a) Evaluate the temperature at which deuterium begins to 
form. (b) At what age does this occur? (c) Evaluate the tem- 
perature and the corresponding time of radiation decoupling 
(when hydrogen atoms form). 

What was the age of the universe when the nucleons con- 
sisted of 60% protons and 40% neutrons? 


15.10 Experimental Cosmology 


19) 


Assuming that the density of the universe is equal to its 
critical value and that 4.6% of the universe is baryonic 
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20. 


val 


matter, calculate the average number of baryons (nucleons) 
per cubic meter in the universe. 

(a) Suppose the baryonic matter in the universe were com- 
posed of uniformly distributed stars of the mass of the 
Sun (2.0 x 103°kg). What would be the average spac- 
ing between the stars? Express your answer in light-years. 
(6) Suppose instead that the baryonic matter were composed 
of uniformly distributed galaxies of the mass of the Milky 
Way (1.2 x 10% kg). Expressed in light-years, what would 
be the average distance between the galaxies? 

Suppose the non-baryonic dark matter consists entirely of 
neutrinos. What is the average rest energy of the neutrinos 
that could account for this part of the mass of the universe? 
As a rough estimate, assume that the neutrino density is the 
same as the present photon density. 


General Problems 


22. 


23; 


24. 


20% 


26. 


Photons of visible light have energies between about 2 and 
3eV. (a) Compute the number density of photons from 
the 2.73-K background radiation in that interval. (It is 
sufficient to characterize the visible region as FE = 2.5eV 
with dE = 1.0eV.) (b) Assume the eye can detect about 
100 photons/em?. At what temperature would the back- 
ground radiation be visible? At what age of the universe 
would this have occurred? 

Consider the universe at a temperature of 5000 K. (a) At 
what age did this occur, and during which stage of the 
evolution of the universe? (b) Evaluate the average photon 
energy at that time. (c) If there are 10° photons per nucleon, 
evaluate the ratio between the radiation density and the mass 
density at that time. 

The early universe was radiation dominated, and the present 
universe is matter dominated. (a) At what temperature were 
the radiation and matter densities equal? (b) What was the 
age of the universe when this occurred? 

A neutron star of 2.00 solar masses is rotating at a rate 
of 1.00 revolutions per second. (a) What is the radius of 
the neutron star? (See Problem 11.) (6) Find its rotational 
kinetic energy. (c) If its rotational speed slows by 1 part 
in 10° per day, find the loss in rotational kinetic energy 
per day. (d) Assuming that the entire energy loss goes into 
radiation, find the radiative power. (e) If the star is 104 
light-years from Earth, what would be the average power 
received by an antenna of area 10m? if the star’s energy 
were distributed uniformly in space instead of concentrated 
in a narrow beam? 

Because we don’t yet have a quantum theory of gravity, 
we cannot analyze the properties of the universe before the 
Planck time, about 10~* s. If we assume that the properties 


27. 


28. 


of the universe during that era were determined by quantum 
theory, relativity, and gravity, the Planck time should be 
characterized by the fundamental constants of those three 
theories: h,c, and G. We can therefore write t x hic’G*, 
where i,j, and k are exponents to be determined. (a) Using 
a dimensional analysis, determine i,j, and k. (b) Assuming 
the proportionality parameter is of order unity, evaluate f. 
(c) What was the size of the observable universe at the 
Planck time? 

Show that the spacetime interval given by Eq. 15.18 is 
invariant with respect to the Lorentz transformation. That is, 
show that (ds)* = (ds’)?, where (ds’)* = (cdt')? — (dx’)*. 
Light from star S in Figure 15.30 passes a distance b 
from a galaxy L, where it is deflected by an angle a and 
then reaches the observer O, who sees an image of the 
star at J. (For simplicity, assume the gravitational deflec- 
tion takes place at a single point, and assume all angles 
in the figure are very small.) The galaxy (of mass M) is 
a distance d,; from the observer, and the star is a dis- 
tance d, from the observer. (a) For small angles, show 
that 6d, = Bdy + (4GM /6d,c*)(ds —d,). (Hint: Use Eq. 
15.22 for the deflection angle a when the impact parameter 
is b instead of R, but double the value to account for the 
difference between the special and general relativity predic- 
tions.) (b) Solve the resulting quadratic equation for 6 and 
show that there are two image positions whose locations 
differ by AO = ,/f* + 462, where the Einstein angle 6, 
is V4GM (dy —d,)/c*dsd,. This is an example of gravita- 
tional lensing, an effect of general relativity that has been 
observed for distant objects that appear in multiple images 
when their light travels a path through spacetime that is 
curved by an intervening galaxy. (c) When the star, lensing 
galaxy, and observer lie along a single line, something other 
than two images appears. Given the symmetry of the figure 
when 6 = 0, what do you expect to be observed in this case? 




















FIGURE 15.30 Problem 28. 


PANeyelevately. 





CONSTANTS AND 
CONVERSION FACTORS* 


CONSTANTS 


Speed of light c 
Charge of electron e 
Boltzmann constant k 
Planck’s constant h 

h = h/20n 

he 

hic 
Gravitational constant G 
Avogadro’s constant Na 
Universal gas constant R 
Stefan-Boltzmann constant o 
Rydberg constant Ry 
Hydrogen ionization energy |E;| 
Bohr radius ag 
Bohr magneton Lp 
Nuclear magneton ON 
Fine structure constant a 
Electric constant e”/4me9 


2.99792458 x 108 m/s 

1.60217657 x 10-!9.C 

1.380649 x 10773 J/K = 8.617332 x 10-5 eV/K 
6.62606957 x 10-*4 J-s = 4.13566752 x 10° eV-s 
1.054571726 x 10734 J-s = 6.58211928 x 107! eV-s 
1239.8419 eV- nm (or MeV: fim) 

197.326972 eV -nm (or MeV - fm) 

6.67384 x 107!! N- m?/ kg? 

6.0221413 x 1073 mole! 

8.314462 J/mole-K 

5.67037 x 10-8 W/m? - K4 

1.097373156854 x 107 m7! 

13.6056925 eV 

5.291772109 x 107!!! m 

9.2740097 x 10-74 J/T = 5.78838181 x 10-> eV/T 
5.0507835 x 10-27 J/T = 3.15245126 x 10-8 eV/T 
1/137.03599907 

1.4399645 eV - nm (or MeV: fin) 


*The number of significant figures given for the numerical constants indicates the precision to which 
they have been determined; there is an experimental uncertainty, typically of a few parts in the last or 
next-to-last digit, except for the speed of light (which is exact). 
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SOME PARTICLE MASSES 


kg u MeV/c? 
Electron 9.1093829 x 1077! 5.485799095 x 10-4 0.51099893 
Proton 1.67262178 x 1077 1.0072764668 938.27205 
Neutron 1.67492735 x 10727 1.0086649160 939.56538 
Deuteron 3.3435835 x 10777 2.0135532127 1875.61286 
Alpha 6.6446568 x 10777 4.001506179 3727.3792 


CONVERSION FACTORS 


leV = 1.60217657 x 107-!9J 1 barn (b) = 10778 m? 
lu = 931.49406 MeV/c? 1 curie (Ci) = 3.7 x 10!° decays/s 
= 1.66053892 x 10-7’ kg 1 light-year = 9.46 x 105m 


ly = 3.156 x 10’s=z x 10's 1 parsec = 3.26 light-year 


PANeyel-vately. 





COMPLEX NUMBERS 


The imaginary number i is defined as /—1. A complex number or function can be 
represented as having a real part, which does not depend on i, and an imaginary 
part, which depends on 7. We can write a complex variable as z = x + iy, where 
the real part x and the imaginary part y are both real numbers or real functions. A 
complex wave function y can be written in terms of its real and imaginary parts 
as Ww = Re(w) + ilm(w). 

The complex conjugate of a complex number is obtained by substituting —i 
fori, as in z* = x — iyor w* = Re(y) — iIm(). 

The squared magnitude of a complex number is defined as the product of the 
number and its complex conjugate, as in |z|? = zz* or ||? = wy", and is equal 
to the sum of the squares of its real and imaginary parts: 


IP =x +y or WP = [Re@)P + [my P 


The complex exponential e” can be represented in terms of real trigonometric 
functions as 


io 


e! id 


=cos@ + isind and e ” =cosé —ising 
The squared magnitude of the complex exponential is equal to |: 


|e? |? = ee = (cos@ + isin)(cos@ — isin@) = cos? @ + sin’ @ = 1 


We can write the ordinary trigonometric functions in terms of these complex 
functions: 


1. ; | eae ; 
sind = 5c" =e ®5 and cos@ = al ie) 
i 


It is sometimes convenient to write the wave function in terms of a complex 
exponential as: 


w= |wle™ 


where |w| gives the magnitude of the wave function and a is its phase. 
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TABLE OF ATOMIC MASSES 


The table gives the atomic masses of some isotopes of each element. All naturally 
occurring stable isotopes are included (with their natural abundances shown in 
italics in the last column). Some of the longer-lived radioactive isotopes of each 
element are also included, with their half-lives. Each element has many other 
radioactive isotopes that are not included in this table. More complete listings 
can be found in the sources from which this table was derived: Table of Isotopes 
(8th Edition), edited by R. B. Firestone and V. S. Shirley (Wiley, 1999); G. Audi, 
A. H. Wapstra, and C. Thibault, “The 2003 Atomic Mass Evaluation,” Nuclear 
Physics A729, 129 (2003). 
In the half-life column, My = 10° y. 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life Z A mass (u) or Half-life 
H 1 1 1.0078250 99.985% Na 11 om 20.997655 22.55 
2 2.014102 0.015% oy) 21.994436 2.60 y 
3 3.016049 12.3 y et 22.989769 100% 
He 2 3 3.016029 0.000137% ae . , 
4 4.002603 99.999863% 35 ces ‘ 
Li 3 6 6.015123 7.59% 
7 7.016005 92.41% Mg 12 22 21.999574 3.88 s 
8 8.022487 0.84 s 23 22.994124 11.35 
Be 4 7 7.016930 53.2 d oe Bree : : ees 
A F 0 
: sets i 26 25.982593 11.01% 
‘ 0 
Wl 11.021658 8s a8 27.983877 20.9 h 
B 5 8 8.024607 0.778 Al 13. -25~=—=«24.990428 ~—Ss 7.18 
? be kaes 0.85 as 26 —«-25.986892 —0..72 My 
10 10.012937 19.8% ae ag oRisas re 
a 11,009305 80.2% 28 —«-27.981910 —-2.24m 
12 12.014352 20.2 ms a6 onueneas peak. 
é 6 10 10.016853 19.3 s 
iat 11.011434 20.3 m Si 14 26 25.992330 2.23 s 
12 12.000000 98.89% 27 26.986705 4168 
13 13.003355 1.11% 28 27.916927 92.23% 
14 14.003242 5730 y 29 28.976495 4.68% 
15 15.010599 2.455 30 29.973770 3.09% 
N 7 13 13.005739 9.96 m 31 30.975363 2.62 h 
14 14.003074 99.63% 32 31.974148 I32y 
15 15.000109 0.37% 
16 16.006102 71s P 15 29 28.981801 4.145 
17 17.008450 42s 30 29.978314 2.50m 
0, 
O 8 14 14.008596 70.6 s - Be ee 
15 15.003066 122 s 33 32.971726 ~—«- 253d 
16 15.994915 99.76% 
17 16.999132 0. 038% 
18 17.999161 0.200% S 16 30 29.984903 1.18 s 
19 19.003580 26.9 s 31 30.979555 2.578 
20. 20.004077 13.5 320 31.972071 95.02% 
33 32.971459 0.75% 
F 9 17 17.002095 64.5 s 34 33.967867 4.21% 
18 18.000938 1.83 h 35 34,969032 87.5d 
19 18.998403 100% 36 35.967081 0.02% 
20 19.999981 lls 37 36.971126 5.05m 
21 20.999949 42s 
Ne 10 18 18.005708 17s Cl 17 33 32.977452 2.515 
19 19.001880 17.28 34 33.973763 1.53 s 
20 19.992440 90.48% 35 34.968853 75.77% 
21 20.993847 0.27% 36 35.968307 0.30 My 
oD 21.991385 9.25% 37 36.965903 24.23% 
23 22.994467 37.25 38 37.968010 37.2m 
24 23.993611 3.4m 39 38.968008 55.6m 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life Z A mass (u) or Half-life 
Ar 18 34 33.980271 0.844 s Cr 24 48 47.954032 21.6h 
35 34.975258 1.78 s 49 48.951336 42.3 m 
36 35.967545 0.337% 50 49.946044 4.35% 
37 36.966776 35.0d si 50.944767 27.74 
38 37.962732 0.063% 52 51.940507 83.79% 
39 38.964313 269 y 53 52.940649 9.50% 
40 39.962383 99.60% 54 53.938880 2.36% 
41 40.964501 1.82h 55 54.940840 3.50m 
42 41.963046 32.9 y 56 55.940653 5.94m 
: 2 eee ee Mn 25 52 51.945565 5.59 d 
See oe 53. —-52.941290 3.7 My 
39 38.963707 93.26% 54 53,940359 312d 
40 39.963998 pies 55 54.938045 100% 
at -40.961826 6.73% 56 —-55.938905.—-2.58h 
ee OR Lh 57 -56.938285  ~—s- 85.4 
43 42.960716 22.3h 
Ca 20 38 37.976318 0.44 s Fe 26 52 51.948114 8.27h 
39 38.970720 0.86 s 53 52.945308 8.51m 
40 39.962591 96.94% 54 53.939611 5.85% 
41 40.962278 0.103 My 55 54.938293 2.74y 
42 41.958618 0.647% 56 55.934937 91.75% 
43 42.958767 0.135% 57 56.935394 2.12% 
44 43.955482 2.09% 58 57.933276 0.28% 
45 44,956187 163d 59 58.934875 44.5d 
46 45.953693 0. 0035% 60 59.934072 1.5 My 
47 46.954546 4.54 d 61 60.936745 6.0m 
48 47.952534 0.187% 
58 57.935753 70.8 d 
Se 21 43 42.961151 3.89 h 59 58.933195 100% 
44 43.959403 3.97h 60 59.933817 5.27Ty 
45 44.955912 100% 61 60.932476 1.65h 
46 45.955172 83.8 d 
47 46.952408 3.35d Ni 28 56 55.942132 6.08 d 
48 47.952231 43.7h 57 56.939794 35.6h 
Ti 22,44 43,959690 60 eS Se 
59 58.934347 0.076 My 
45 44.958126 3.08 h 
60 59.930786 26.22% 
46 45.952632 8.25% 
61 60.931056 1.14% 
47 46.951763 7.44% 
62 61.928345 3.63% 
48 47.947946 73.72% 
; 63 62.929669 100 y 
49 48.947870 5.41% 
: 64 63.927966 0.93% 
50 49.944791 5.18% pas ene nee 
51 50.946615 5.76 m 
SR Cu -29—Ss«6L_S~Ss«iO. 933458333 
V 23 48 47.952254 16.0 d 62 61.932584 9.67 m 
49 48.948516 329 d 63 62.929597 69.17% 
50 49.947158 0.250% 64 63.929764 12.7h 
51 50.943960 99.750% 65 64.927789 30.83% 
52 51.944775 3.74 m 66 65.928869 512m 
53 52.944338 1.60 m 67 66.927730 61.8h 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life Z A mass (u) or Half-life 
Zn 30 62 61.934330 9.19h Br 35 (fe 76.921379 57.0h 
63 62.933212 38.5m 78 77.921146 6.46 m 
64 63.929142 48.6% 79 78.918337 50.69% 
65 64.929241 244d 80 79.918529 17.7m 
81 80.916291 49.31% 
66 65.926033 27.9% 
a becsieg aie 82 81.916804 35.3h 
= ee ey pee 83 82.915180 2.40 h 
69 68.926550 56m Kr 36 76 75.925910 14.8h 
70 69.925319 0.62% 77 76.924670 74.4m 
71 -70.927722-2.45m 78 77,920365 (0.35% 
79 78.920082 35.0h 
Ga 31 67 66.928202 3.26 d 80 79.916379 2.28% 
68 67.927980 67.7m 81 80.916592 0.229 My 
5 82 81.913484 11.58% 
ie pee peti 83 82.914136 11.49% 
7 papas g08bes 84 83.911507 57.00% 
aliens 85 84.912527 10.8 y 
72 ——-71.926366  14.1h 86  -85.910611 —-17.30% 
73 72.925175 4.86h 87 86.913355 76.3 m 
Ge 32 68 67.928094 271d Rb 37 83 82.915110 86.2 d 
69 68.927965 39.0h 84 83.914385 33.1.d 
70 69.924247 20.4% 85 84.911790 72.17% 
71 —-70.924951 «11.4 a els, 6 TR 
87 86.909181 27.83% 
ie Teenie e730 88 87.911316 17.8m 
2B 72.923459 7.8% 
74 73.921178 36.7% Sr 38 82 81.918402 25.6d 
75 74.922859 82.8 m 83 82.917557 32.4h 
16 75921403 7 8% 84 83.913425 0.56% 
77 76923549 11.3h a | 
86 85.909260 9.86% 
As 33 2B 72.923825 80.3 d 87 86.908877 7.00% 
75 -74.921596 -100% ie ae : 
16 75.922394 26.3h ; 
71 76.920647 38.8h Y 39 87 86.910876 79.8h 
88 87.909501 106.6 d 
Se 34 iP) 71.927112 8.4d 89 88.905848 100% 
73 72.926765 71h 90 89.907152 64.1 h 
74 73.922476 0.89% 91 90.907305 58.5 d 
75 74.922523 120d Zr 40 88 87.910227 83.4d 
76 75.919214 9.4% 89 88.908890 78.4h 
77 76.919914 7.6% 90 89.904704 51.45% 
78 77.917309 23.8% 91 90.905646 11.22% 
719 78.918499 0.30 My 92 91.905041 17.15% 
80 79.916521 49.6% 93 92.906476 1.53 My 
81 -80.917992,«18.5m 2 ee 
89 81.916699 2704 95 94.908043 64.0 d 
83 82.919118 223m 7 eee 2G 
97 96.910953 16.7h 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life Z A mass (u) or Half-life 
Nb Al 91 90.906996 680 y Pd 46 100 99.908506 3.63 d 
92 91.907194 35 My 101 100.908289 8.47 h 
93 92.906378 100% 102 101.905609 1.02% 
94 93.907284 20,300 y 103 102.906087 17.0d 
95 94.906836 35.0d 104 103.904036 11.14% 
105 104.905085 22.33% 
Mo 42 90 89.913937 5.56h 106 105.903486 27.33% 
91 90.911750 15.5m 107 106.905133 «6.5 My 
92 91.906811 14.8% 108 107.903892 26.46% 
93 92.906813 4000 y 109 108.905950 13.7h 
94 93.905088 9.3% 110 109.905153 11.72% 
95 94.905842 15.9% 111 110.907671 23.4m 
96 95.904679 16.7% 
97 96.906021 9.6% 
98 97.905408 24.1% Ag 47 105 104.906529 413d 
99 98.907712 65.9 h 106 105.906669 24.0 m 
100 99.907477 9.6% 107 106.905097 51.84% 
101 100.910347 14.6m 108 107.905956 237m 
109 108.904752 48.16% 
Tc 43 95 94.907657 20.0 h 110 109.906107 24.65 
96 95.907871 43d 
97 96.906365 4.2 My 
98 97.907216 4.2 My Cd 48 104 103.909849 57.7m 
99 98.906255 0.211 My 105 104.909468 55.5 m 
100 99.907658 15.85 106 105.906459 1.25% 
107 106.906618 6.50 h 
Ru 44 94 93.911360 51.8 m 108 107.904184 0.89% 
95 94.910413 1.64h 109 108.904982 461d 
96 95.907598 5.5% 110 109.903002 12.5% 
97 96.907555 2.79 d 111 110.904178 12.8% 
98 97.905287 1.86% 112 111.902758 24.1% 
99 98.905939 12.8% 113 112.904402 12.2% 
100 99.904219 12.6% 114 113.903359 28.7% 
101 100.905582 17.1% 115 114.905431 53.5h 
102 101.904349 31.6% 116 115.904756 75%, 
103 102.906324 39.3d 117 116.907219 2.49 h 
104 103.905433 18.6% 
105 104.907753 4.44h 
In 49 111 110.905103 2.80 d 
Rh 45 101 100.906164 3.3 y 112 111.905532 15.0m 
102 101.906843 207 d 113 112.904058 4.29% 
103 102.905504 100% 114 113.904914 71.98 
104 103.906656 42.3 s 115 114.903878 95.71% 
105 104.905694 35.4h 116 115.905260 14.15 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life Z A mass (u) or Half-life 
Sn 50 110 109.907843 41lh Xe 54 122 121.908368 20.1h 
111 110.907734 35.3m 123 122.908482 2.08 h 
112 111.904818 0.97% 124 123.905893 0.095% 
113 112.905171 115.1d 125 124.906395 16.9h 
114 113.902779 0. noe 126 125.904274 0.089% 
7 7 ngs So 
117 116.902952 7.68% 28 celcdasiale par 
118 117.901603 24.22% og ene? mone 
119 118.903308 8.59% 130 129.903508 = 4.07% 
120. -119,902195 32.58% = Se ee 
1st 120.904235 27.0 h 132 131.904153 26.91% 
122 121.903439 4.63% 133 132.905911 5.24d 
123 122.905721 129d 134 133.905394 10.44% 
124 123.905274 5.79% 135 134.907227 9.14h 
125 124.907784 9.64d 136 135.907219 8.86% 
126 125.907653 0.23 My 137 136.911562 3.82 m 
a ee 7 ‘ac . _ Cs 55 ~—«131-—~—Ss«*:30,905464 «9.69. d 
121 120.903816 57.21% a ee eeed 
122 -121.905174 2.72. ea 
123 122.904214 42.79% pe (Rete ay 
124 123.905936 60.1 d 135 134.905977 2.3 My 
125 124.905254 2.76 y 136 135.9073 12 13.0d 
Te 52 118 117.905828 6.00 d Ba 56 128 127.908318 2.43 d 
119 118.906404 16.1h 129 128.908679 2.23 h 
120 119.904020 0.09% 130 129.906321 0.106% 
122 121.903044 2.55% 132 131.905061 0.101% 
7 3 iain? 085 
125 124,904431 7.07% oe pce ae 
126 ——-125.903312 18.84% a ich ea 
127 126.905226 —-9.35h Be eee 
128 127.904463 31.74% 137 136.905827 11.23% 
129 128.906598 69.6 m 138 137.905247 71.70% 
130 129.906224 34.08% 139 138.908841 83.1m 
131 130.908524 25.0m 
La 57 136 135.907636 9.87m 
126 125.905624 12.9 ae ‘aunts 0.090% 
i: sea . a 139 -—-138.906353 «9.910% 
129 128904988 15.7 My 80 secon: sabi 
130 129,906674 12.4h 141 140910962 3.92 h 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life Z A mass (u) or Half-life 
Ce 58 134 133.908925 76h Eu 63 149 148.917931 93.1 d 
135 134.909151 17.7h 150 149.919702 36.9 y 
a ate I aes 151 ——-150.919850 47.81% 
rl nana ee 152 151.921745 13.5 y 
4 e 0 0. 
139 —«138,906653—s«137.6d eS Dei See 
140 139.905439 88.45% 154 153.922979 8.59 y 
141 140.908276 32.5d 155 154.922893 4.75 y 
142 141.909244 11.11% 156 155.924752 15.2d 
143 142.912386 33.0h 
144 143.913647 285 d Gd 64 150 149.918659 1.79 My 
Pr 59 139 138.908938 4.41 h 151 150.920348 124d 
140 139.909076 3.39 m 152 151.919791 0.20% 
141 140.907653 100% 153 152.921750 240 d 
142 141.910045 19.1h 154 153.920866 2.18% 
143 142.910817 13.6d 155 154.922622 14.80% 
Nd 60 140 139.909552 3.37d 156 155.922123 20.47% 
141 140.909610 2.49 h 157 156.923960 15.65% 
142 141.907723 27.2% 158 157.924104 24.84% 
143 142.909814 12.2% 159 158.926389 18.5h 
144 143.910087 23.8% 160 159,.927054 21.9% 
145 144.912574 8.3% 161 160.929669 3.66 m 
146 145.913117 17.2% 
ao Th 65 157 ~—«:156.924025.—71y 
149 148.920149 73h 158 157.925413 180 y 
150 149.920891 5.6% 159 158.925347 100% 
151 150.923829 12.4m 160 159.927168 72.3 d 
161 160.927570 6.91d 
Pm 61 143 142.910933 265d 
lad 439125913638 yy gg sd 53.924408 3.0My 
‘ ; 155 154.925754 9.9h 
146 145.914696 5.53 y o ie pom 
147 146.915139 2.62 y 06% 
148 147.917475 5.37 d 157 156.925466 8.1h 
149 148.918334 53.1h 158 157.924409 0.10% 
sm 62 142  141.915198  72.5m 159 :158.925739 144.4 
143 :142.914628 8.75 m me eee Be 
144 143.911999 3.1% 161 160.926933 18.9% 
145 144.913410 340 d 162 161.926798 25.5% 
146 145.913041 103 My 163 162.928731 24.9% 
147 146.914898 15.0% 164 163.929175 28.2% 
0, 
148 147.914823 11.2% 165 164.931703 2.33 h 
149 148.917185 13.8% 
7 see ee Ho 67 163 = 162.928734 4570 
y 
152 -:151.919732 26.7% | EE cae cada. 
153 152.922097 46.3h 165 164.930322 100% 
154 153.922209 22.7% 166 165.932284 26.8 h 
155 154.924640 22.3m 167 166.933133 3.0h 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life Z A mass (u) or Half-life 
Er 68 160 159.929083 28.6 h W T4 178 177.945876 21.6d 
161 160.929995 3.21h 179 178.947070 37.0 m 
162 161.928778 0.14% 180 179.946704 0.12% 
163 162.930033 75.0 m 181 180.948197 121d 
164 163.929200 1.60% 182 181.948204 26.5% 
165 164.930726 10.4h 183 182.950223 14.3% 
166 165.930293 33.50% 184 183.95093 1 30.6% 
167 166.932048 22.87% 185 184.953419 75.1d 
168 167.932370 26.98% 186 185.954364 28.4% 
169 168.934590 9.39d 187 186.957160 23.7h 
170 169.935464 14.91% 
171 170.938030 7.52h Re 75 183 182.950820 70.0 d 
Tm 69 167 166.932852 925d eS ee Ce 
168 :167.934173 «93.1.4 me, ee Ee 
186 185.954986 3.72 d 
169 168.934213 100% 
187 186.955753 62.60% 
170 169.935801 128.6d 188 187.958114 170h 
171 170.936429 1.92 y : : 
Yb 70 166 165.933882 56.7 h Os 76 182 181.952110 22.1h 
167 166.934950 17.5 m 183 182.953126 13.0h 
168 167.933897 0.13% 184 183.952489 0.02% 
169 168.935190 32.0 d 185 184.954042 93.6 d 
170 169.934762 3.0% 186 185.953838 1.6% 
171 170.936326 14.3% 187 186.955750 1.6% 
172 171.936381 21.8% 188 187.955838 13.3% 
173 172.938211 16.1% 189 188.958147 16.2% 
174 173.938862 31.8% 190 189.958447 26.4% 
175 174.941276 4.19d 191 190.960930 15.4d 
176 175.942572 12.8% 192 191.961481 40.9% 
177 176.945261 1.9h 193 192.964152 30.1 h 
m m eA ace i Ir 77 189 188.958719 13.2d 
; 3y 
175 174.940772 97.41% 190 189.960546 11.8d 
176 175.942686 2.59% 191 190.960594 37.3% 
177 176.943758 6.65 d 192 191.962605 73.8d 
193 192.962926 62.7% 
Hf 72 172 171.939448 1.87 y 194 193.965078 19.3h 
173 172.9405 13 23.6h 
174 173.940046 0.16% Pt 78 188 187.959395 10.2d 
175 174.941509 70d 189 188.960834 10.9h 
176 175.941409 5.26% 190 189.959932 0.014% 
177 176.943221 18.60% 191 190.961677 2.86d 
178 177.943699 27.28% 192 191.961038 0.78% 
179 178.945816 13.62% 193 192.962987 50y 
180 179.946550 35.08% 194 193.962680 32.97% 
181 180.949101 424d 195 194.964791 33.83% 
196 195.964952 25.24% 
180 179.947465 0.012% 198 197.967893 716% 
181 180.947996 99. 988% 199 198.970593 30.8 m 
182 181.950152 114d 





Atomic Abundance 
Z A mass (u) or Half-life 
Au 79 195 194.965035 186d 
196 195.966570 6.17 d 
197 196.966569 100% 
198 197.968242 2.696 d 
199 198.968765 3.14d 
Hg 80 194 193.965439 444 y 
195 194.966720 10.5h 
196 195.965833 0.15% 
197 196.967213 64.1 h 
198 197.966769 10.0% 
199 198.968280 16.9% 
200 199.968326 23.1% 
201 200.970302 13.2% 
202 201.970643 29.9% 
203 202.972872 46.6d 
204 203.973494 6.9% 
205 204.976073 5.1m 
Tl 81 201 200.970819 72.9h 
202 201.972106 12.2d 
203 202.972344 29.52% 
204 203.973864 3.78 y 
205 204.974428 70.48% 
206 205.976110 4.20m 
Pb 82 202 201.972159 53,000 y 
203 202.973391 51.9h 
204 203.973044 1.4% 
205 204.974482 17.3 My 
206 205.974465 24.1% 
207 206.975897 22.1% 
208 207.976652 52.4% 
209 208.981090 3.25h 
Bi 83 207 206.978471 32.9 y 
208 207.979742 0.368 My 
209 208.980399 100% 
210 209.984120 5.01 d 
211 210.987269 2.14m 
Po 84 207 206.981593 5.80 h 
208 207.981246 2.90 y 
209 208.982430 102 y 
210 209.982874 138.4d 
At 85 209 208.986173 5.41h 
210 209.987148 8.1h 
211 210.987496 7.21h 
Rn 86 211 210.990601 14.6h 
222 222.017578 3.82 d 
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Atomic Abundance 
Z A mass (u) or Half-life 
Fr 87 212 211.996202 20.0 m 
223 223.019736 22.0 m 
Ra 88 223 223.018502 11.43 d 
224 224.020212 3.63 d 
225 225.023612 14.9d 
226 226.025410 1600 y 
Ac 89 225 225.023230 10.0d 
226 226.026098 29.4h 
227 227.027752 21.77 y 
Th 90 228 228.028741 1.91y 
229 229.031762 7340 y 
230 230.033 134 75,400 y 
231 231.036304 25.52h 
232 232.038055 100% 
233 233.041582 21.8 m 
Pa 91 230 230.034541 17.4d 
231 231.035884 32,800 y 
232 232.038592 131d 
234 234.043308 6.70 h 
U 92 233 233.039635 0.1592 My 
234 234.040952 0.2455 My 
235 235.043930 0.720% 
236 236.045568 23.42 My 
237 237.048730 6.75 d 
238 238.050788 99.274% 
239 239.054293 23.5 m 
Np 93 236 236.046570 0.154 My 
237 237.048173 2.14 My 
238 238.050946 2.117 d 
Pu 94 238 238.049560 87.74 y 
239 239.052163 24,100 y 
240 240.053814 6561 y 
241 241.056851 14.3 y 
242 242.058743 0.375 My 
Am 95 241 241.056829 432 y 
242 242.059549 16.0h 
243 243.061381 7370 y 
Cm 96 246 246.067224 4760 y 
247 247.070354 15.6 My 
248 248.072349 0.348 My 
Bk 97 247 247.070307 1380 y 
CE 98 251 251.079587 898 y 
254 254.087323 60.5 d 
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Atomic Abundance Atomic Abundance 
Z A mass (u) or Half-life ZL A mass (u) or Half-life 
Es 99 252 252.082979 472d Sg 106 261 261.116117 0.23 s 
Fm 100 257 257.095105 100.5 d Bh 107 262 262.122892 0.10s 
oe a Ae ee te Hs 108 264.—264.128395 0.8 ms 
ee ee ee Mt 109 .266-——«(266.137299-—1.7ms 
Lr 103 260 260.105504 3.0m 
Ds 110 270 270.144720 0.5 ms 
Rf 104 261 261.108767 65s 
Rg 111 272 272.153615 4ms 
Db 105 262 262.114084 35s 


ANSWERS TO ODD-NUMBERED 


PROBLEMS 


Chapter 1 


1. 4.527 x 10° m/s 
3. (a) —7.79 x 10° m/s 
(b) 1.008 x 10713 J,3.995 x 10-8 J 
5. (a) 2.13 x 10° m/s 
(b) 1.28 x 10° m/s 
7. 4.34 x 1075 
9. 6.1 x 10-°N 
11. 35.3°, v/V/2, 0/76 
13. 2.47 x 10° m/s, —0.508 x 10° m/s 
15. (a)0.0104eV (b) 2550m 
17. 4.61 x 10! rad/s 


Chapter 2 


1. 101 km/h at 62° east of south 
3. 7x 10*m/s 
5. 2.6 x 108 m/s 
7. (a)357.1ns (b)103m = (c) 28.8m 
11. 0.402c 
13. 5.0 x 10’ m/s 
17. +0.937c, —0.572c 
19. (a)+0.508us (6) —81.5m 
21. (a) 2:00 PM. (b) 3:00 PM. 
(c) 1:00 P.M. and 3:00 P.M. 
23. 8y 
25. (a) Ki = K. =0.512me* 
(b) K, = K, = 0.458mc* 
27. 0.958c¢ 
29. v<0.115¢ 
33. (a) 773 MeV/c? (b) 1227 MeV 
35. 4.4.x 107! kg 
37. p. = 10.5 MeV/c, K, = 10.0 MeV 
Py = 137-4MeV/c, K, = 10.0MeV 
39. (a) 1268.1MeV_ (b) 298.8 MeV/c 
(c) 1232 MeV 
41. 0.981¢ 


43. 
45, 
. 2.34km, 1.07 1s 
49, 


51. 


53. 
DD: 
57. 


6438s 

(a) 0.99875¢ (b) 400.5y 

(a) 0.648 us (6) 335m 

(a) EB = me? / (1 = w/c?) = v?/c?) 
; miu + v2 — uv? /c2 


© V0 — 7/2) 











P 





(b) m*c* 

(a)3.1MeV (b)7.8MeV 
0.508c 

267.0 MeV, 28.9 MeV 


Chapter 3 


ie 
oF 
5. 


11. 
19. 
21. 
23. 
20: 


31. 
33. 
3D: 
37. 
39. 


41. 


1.32mm 

(a) 0.388nm (b) 7.2° 

(a) 1.00 x 107 eV/c, 5.33 x 1077! kg- m/s 
(b) 2.5 x 104 eV/c, 1.3 x 107% kg- m/s 
(c) 1.2eV/c,6.6 x 10-2 kg- m/s 

(d) 6.2 x 10-7 eV/c, 3.3 x 10-4 kg m/s 
(a)0.124nm (b) 1.24 x 10-3 nm 

(c) 1.8eV to 3.5eV 

0.964 V 

(a) 4.88 eV 

1.1mm, 1.1 x 10-7 eV 

0.33 W 

(a) 1.9 x 10°W/m? (b) 0.26% 

(a) 10.33keV (b) 0.06 keV 

3.9 x 10-+eV 

1.17eV 

2.28 eV, 4.10 x 1075 eV/s 

(a) 2.52pm  (b) 0.405 

2.724K 

(a) 5.79 x 10719 W/m?, 1.91 x 107!) W/m? 
(b) 1.81 x 108/s,2.97 x 10’/s 

4.1 m/s 
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Chapter 4 
1. (a)13fm (b)0.025fm (c) 0.73 nm 
3. (a)0.143c (6) —9.72MV 
5. (a) +0.010V (b)+100V_ (c)+1.0 x 10°V 
7. (a)88MeV (b)4.2MeV_ (c) 1.1 MeV 
9. 33nm 
11, 15.2°G = 1), 31.8" = 2), 52.2°(n = 3) 
13. (a)660m (6) 0.33m 
(c)0.017mm_ (d) 0.050 Hz 
15. 8.4 x 104 Hz 
17. 5.8nm 
19. 33MeV 
21. 25% 1 ey 
23. 0.052 MeV 
29. Veroup = L Si giaae 


31. (b) 1.50 eV, 6.00 eV, 13.5eV 
33. (a)0.0279eV  (b) 3.3nm 
35. (a)0.71MeV  (b) 0.66 MeV 
37. (a) 135MeV  (b) 4.87 x 10-24 
39. 3x 10-9m 


(c) 2.1K 


(c) 1.46 fm 


Chapter 5 


1. (b) —(B/m)./2H/g,H(1 + B/mg) 
3. lleV 
5. 150 eV, 600eV, 1350eV 
7. (@)c=ab*,d=a(b+1) 
(b) w = a(2b+ 1) 

9. U(x) = —h’b/mx, E = —hb*/2m 
11. 10.1eV, 18.9eV 
15. (a) 2.63 x 107° 
17. 5.00Ep, 10.00E, 
19, 385685, 98); 11E,; 128 ys 
21. (b) /3hay/k, Shay /k 
23. (a)0 (b) hagm/2 
25. A*dx,0.36847dx 
27. B=D=-AJ/E/(( —£) 

31. (a)2160eV_ (b) 4.70 x 10% eV/c 
(c) 4.2 x 1073 nm 

35. (b) h?n* /4L? 

37. 0.157 


(b) 0.0106 (c) 5.42 x 1073 





Chapter 6 


3. (a) 6.57 x 10! Hz, 45.7 nm 
(b) 3.48 x 10! Hz, 86.2nm 

5. (a)22.8fm (6) 55.0 fm 

(c) 4.14 MeV, 0.86 MeV 

14 fm 

9. (a) 8.4MeV 
(c) 6.00 fm 


a 


(b) 1.61 fm 
(d) 8.3 x 10~° 


11. 
13. 
153 
17. 
19. 
21. 
23. 
21 
31. 
33. 


ee 
ote 
39. 
41, 


0.63 MeV 

28.2 fm, 19.9 fm, 38.9° 

121.51 nm, 102.52 nm, 97.21 nm 

2279 nm 

91.13 nm, 820.1 nm 

AE = 0.306 eV, 0.97 eV, 2.86 eV, 13.1 eV 
7T4eV 

—54.40 eV, —13.60 eV, —6.04 eV, —3.40 eV 
2.10eV 

(a) 6.58 x 10! Hz, 7.72 x 10!? Hz 

(b) 6.58 x 10° Hz, 6.68 x 10° Hz 

(a)15 (6)5 (c)l 

7x 10-8 eV 

48.23 nm 


(a)0.440nm_ (b) 11.3% 


Chapter 7 


13. 
15. 
17. 
21. 
23 


25. 
2h. 


29. 
31. 
33. 


SIO W 


—me* /32n7 eh" 

0.0108 

35°,66°, 90°, 114°, 145° 

(a) 0, 1, 2,3,4,5 (6) +6 to —6 in integer steps 
(c)5 (d)4 

(a)0,0 (b)0,3.2 x 107! 

(c) 0,3.2 x 10-!! 

(d) 1.1 x 10-!!,2.2 x 107! 

Gs J/5)aq 

0.0054 

Minima: 55°, 125° Maxima: 0°, 90°, 180° 
3s, 2s, ls, 3d 

(a) 7s, Tp, 7d, 1f, 7g, 7h, Ti 

(d) 6p, Of, Sp, Sf, 4p, Af, 3p, 2p 

(a) 656.112 nm, 656.182 nm, 656.042 nm 
1.89 eV + 2.55 x 10> eV, 

1.89eV + 1.95 x 107° eV 

0.651, 0.440 

6g, 5a 

3.3mm 





Chapter 8 


1. 


3: 


(a) (2, 1, +1,+41/2), 2,1, +1, =1/2), .«. 
(b) 36 (c)30 (d) 36 

(a) 14 (b)+3/2 (ec) +8 

(d)+2 (e)+10 

(a) N, P, As, Sb, Bi 

(b) Co, Rh, Ir, Mt 

(a) [Ar]4s?3d®  (b) +2 

(+2 (d) 43,41 

—1.51eV, —0.85 eV 


11. 


13: 
15, 
17. 
19. 


23. 
25. 


(a)0.045eV  (b) 3.374 eV, 4.373 eV, 4.750 eV 
(c) 3.54 eV, 2.02 eV 

3.68 eV, 15.5 eV, 63.7 eV 
(a)5,1 (6)2,4 (c)2,1 
0, 1, 2, 3, 4; 0,1 

(a) 1.12 x 10!° photons/s 
(a) 1.84 (6) 1.00 
1.40eV!/?, 6.4 

(a) 670.8nm (6) 58.4nm 
(c) 230 nm, 50.6 nm 


(b) 763 V/m 


Chapter 9 


15.4eV 

(a) 4.25 eV/molecule 
(c) 9.80 eV/molecule 
(a) Li, (c)CO 
5.1leV 


(a) 30.9 x 10-°°C-m_ = (b) 88% 


. 42.7% 


4.689 x 10° eV/nm? 

(a) 0.2626eV = _(b) 0.2579 eV 
23.1mm, 11.6mm, 7.71 mm 
2.00 x 10-% eV 


. Af —2BL4+1),Af +2BL 


(a) 0.27 eV, 0.23 eV, 0.19 eV 

(b) 4.56 eV, 4.60eV 

(a)0.317eV  (b) 2.37 x 107! eV/m? 
(c) 2.1 x 10-7 eV 

(a)8.2x 10-2? (b) 6.8 x 107-7 

(a) 2.99, 4.97, 6.91 

(b) 2.94, 4.70, 6.18 

(a) 194.4 weV 


Chapter 10 


1eS) 


11. 
133 


17. 
19. 
21. 


23. 
25. 


(a)3 (b)3,6,1 (c) 20%, 40% 
(a)6 (b)2 (c)4,2 

(a) +3 (1), +2 (2), +1 (3), 0 (4),... 
(b) 3 (7), 2 (5), 1 (3), 0 (1) 

(c) +3 (1), +2 (1), +1 (2), 0 (2), ... 
2n (2s + 1)m/h* 
2.7 x 1073 m-3 
0.65, 0.29, 0.06 
(a) 0.0379 eV 
(a) 554 weV (6) 0.066 weV 
(a) 10? atm (6) 2.9K 

(a) 1.84 x 107 eV/m? 

(b) 0.035 (c)2.3 x 107!” 
7.04 eV, 4.22 eV 

(a) 4.37 x 10-* mr 


(b) 2.78 x 10?! 


(b) 6.00 x 10-° m3 


2h 
29. 
31. 
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(b) 8.6km (c) 2.2 x 10!8 kg/m? 
0.078 /K 
(a) eT E/kT (b) Rem ie 1) 


(c) NE/(e8/*? + 1) 
(d) R(E/KT)?e®/*" /(eF/*? + 1)? 





33. (a) 0.3295, 0.3333, 0.3372 
35. 0.12eV 
37. 17.1 MeV, 22.9 MeV 
39. (a) 2.83 x 10-3nm = (b) 1.39 x 10-3 nm 
41. (a)2.78+0.06K (6) 0.0137 + 0.0017 
Chapter 11 
1. (a 1/8 (b)a=2r_ (c) 0.5236 
5. (a)6.82eV (b)6.45eV  (c)3.27eV 
7. —8.96eV, 1.00 eV 
9. 0.255nm 
11. 3.23K 
13. 9LIK 
15. (a)7.09eV (b6)0.11eV 
17, 23% 10 a> 
19. 7080 K 
21. 426W/K-m 
23. 790 
25. (a)0.20MeV (6) 6.3mm 
27. 603 MHz 
29, (a) 1,9 x 10-%, 3.5 x 10-* 
31. (a) —0.094eV  (b) —0.040 eV 
33, 42% 107° 
35. 6.8 x 107° mA 
37. (a) 57% and 43%  (b) 89% and 11% 
39. (a) 0.4997, 0.5003 (5) 0.480, 0.520 
41. (a)10.8x10-© (b) —45.3 x 10-6 
49. (a) 135K (b) 165K, 1.40m, 
51. (a) 1.00 x 10° (b) 1.00 x 10°, 1.00 x 1077 
(c) 0.662 keV 
Chapter 12 
1. @)"OFio (BAU (0) YAR. 
3. (a) 15MeV_ (6) 824MeV 
5. (a) 1636.4 MeV, 7.868 MeV 
(b) 1118.5 MeV, 8.410 MeV 
7. 7.718 MeV, 8.482 MeV 
9. (a) 19.814MeV  (b) 15.958 MeV 
11. 2.5 x 1077 fm 
13. 1°Dy: yes! Dy: no 
15. (a)35min (b)0.020min"! (c) 46s7! 
17. 0.0598 
19. (a)0.85uCi (6) 0.13% 
21. (a) Yes (b)No_ (c) Yes 
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25. 0.864 MeV 
27. 0.412 MeV, 1.088 MeV, 0.676 MeV 

29. 100.1 keV, 300.9 keV, 200.8 keV, ... 

31. (a)6 (b)4 (c)42.659MeV_ (d) 27.8 uwW 
33. 12.6 fm 

35. 37.5 wCi 

39. 1.93 mW 

41, 1.04 x 10-29 s—!,9.0 x 108 s-! 

43. (a) 7.52 x 10-*MeV_ (b) 3.26 x 10-4 MeV 
45. (a)4.67x 10-°eV_ (b) 1.95 x 10-3 eV 


(c) 0.097 mm/s 
Chapter 13 
1. (a) 1Hy (c) ae 
3. 98b 
5. 8.6x 10-*b 
7. (a)0.5 (b)0.75 (cc) 0.9325 
11. 1.58 wCi 


13. (a)—10.313MeV  (b)2.314MeV 
15. 12.201 MeV, 11.152 MeV or 

22.706 MeV, 0.647 MeV 
17. (a) 235.3MeV_ (b) 202.0 MeV 
19. (a) 6.546MeV_ (b) 4.807 MeV 
21. (a) 1.943 MeV, 1.199 MeV, 7.551 MeV,... 
23. 14.1 MeV 
25. 7.274MeV 
31. (a) 5.594MeV 
33. 4.49 b, 2.20b 
35. 8.662 MeV 
37. 9.043 MeV 
39. (a)3.2 x 108J 


(b) 0.56 W 


(b) 6.8 x 108J 


Chapter 14 
1. (a) Strong (6) Electromagnetic (c) Weak 
7. (a) L, (6) S 
11. (a) 1.29keV (6) 0.21 MeV 
13. 179.4MeV 
15. (a)34MeV_ (6) 218 MeV 
17. (a) 43 MeV, 19MeV_ (5) 30 MeV, 4MeV 
19. 387 MeV, 34.9° 
21. (a) 181MeV_ (b) —605 MeV 
23. (a)2205MeV_ (6b) 903 MeV 
25. (a) uu annihilation, creation of 2 ss pairs 
27. (a)s>ut+W andW -—d+u 
29. cu, etc. 
31. 0.999635c, 40.2 GeV 
33. (a)0.360c (b)0.637c (c) 279 MeV 
35. 47MeV 
Chapter 15 
1. (a) 590.0nm (6) 637.1 nm 
3. (b) 6.64 x 10-4 eV 
5. 0.089 nm 
7. 1.6 x 10-* Hz 
11. (6) 128 rev/s 
15. (a)5.8x 10!7K (6) 6.7 x 107s 
17. (a)1.0x 10°K (6)225s (c) 6600K, 1.6 x 10° y 
19. 0.27m-3 
21. 3.0eV 
23. (a)2.9x 10? y (b)1.17eV (c) 1.24 
25. (a)9.79km (b) 3.01 x 10°°J 


(c) 6.02 x 10°°J 
(e) 6.21 x 10-4 Ww 


(d) 6.98 x 107° W 
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Absorption edge, 241 
Absorption spectra, 181 
Abundances, light elements, 429 
Acceptor states, 352 
Actinides, 240 
Activity, 383—385 
Alpha decay, 163-164, 382, 383, 
387-390 
kinetic energy, 388 
Q value, 388 
quantum theory, 389-390 
Ammonia inversion, 164 
Angular frequency, 70 
Angular momentum, 6 
classical orbits, 200 
conservation of, 6 
intrinsic (spin), 211, 215 
quantization of, 184 
quantum number, 201, 204 
rules for addition, 245 
uncertainty relationship, 202—203 
Angular probability density, hydrogen, 
201-211 
Annihilation, electron-positron, 93-94 
Antibonding state, 261, 265 
Antimatter, 445, 510 
Antineutrino, 391—392, 445 
Antiparticles, 445 
Atoms, basic properties, 170-171 
Avogadro’s constant, 10 
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quantum theory of, 340—341 
Electrical conductivity, 339 
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p-type semiconductor, 352 
Pulsars, 497-499 


Q 


Q value 
nuclear reactions, 414—416 
particle decays, 458—460 
particle reactions, 461—463 
radioactive decay, 386 
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SOME MILESTONES IN THE HISTORY OF MODERN PHYSICS 





1887 
1896 
1900 
1905 
1905 
1911 
1911 
1913 
1913 
1914 
1914 
1915 
1916 
1919 
1919 


1921 


1923 
1924 
1925 
1925 
1926 
1926 
1927 
1927 


1928 
1929 
1931 
1931 


Albert A. Michelson and Edward W. Morley fail to detect ether. 

Henri Becquerel discovers radioactivity. 

Max Planck introduces quantum theory to explain thermal radiation. 

Albert Einstein proposes the special theory of relativity. 

Albert Einstein introduces the concept of the photon to explain the photoelectric effect. 

Heike Kamerlingh-Onnes discovers superconductivity. 

Ernest Rutherford proposes the nuclear atom, based on experiments of Hans Geiger and Ernest Marsden. 
Niels Bohr introduces theory of atomic structure. 

William H. Bragg and William L. Bragg (father and son) study X-ray diffraction from crystals. 
James Franck and Gustav Hertz show evidence for quantized energy states of atoms. 

Henry G. J. Moseley shows relationship between X-ray frequency and atomic number. 

Albert Einstein proposes the general theory of relativity. 

Robert A. Millikan measures photoelectric effect to confirm Einstein’s photon theory. 

Ernest Rutherford produces first nuclear reaction that transmutes one element into another. 


Sir Arthur Eddington and other British astronomers measure gravitational deflection of starlight and confirm 
predictions of Einstein’s general theory of relativity. 


Otto Stern and Walter Gerlach demonstrate spatial quantization and show necessity to introduce intrinsic 
magnetic moment of electron. 


Arthur H. Compton demonstrates change in X-ray wavelength following scattering from electrons. 
Louis de Broglie postulates wave behavior of particles. 

Wolfgang Pauli proposes the exclusion principle. 

Samuel Goudsmit and George Uhlenbeck introduce the concept of intrinsic angular momentum (spin). 
Erwin Schrédinger introduces wave mechanics (quantum mechanics). 

Max Born establishes statistical, probabilistic interpretation of Schrodinger’s wave functions. 

Werner Heisenberg develops principle of uncertainty. 


Clinton Davisson and Lester Germer demonstrate wave behavior of electrons; G. P. Thomson independently 
does the same. 


Paul A. M. Dirac proposes a relativistic quantum theory. 
Edwin Hubble reports evidence for the expansion of the universe. 
Carl Anderson discovers the positron (antielectron). 


Wolfgang Pauli suggests existence of neutral particle (neutrino) emitted in beta decay. 


1932 
1932 
1932 
1934 
1935 
1938 
1938 
1940 
1942 
1945 
1946 
1948 
1952 
1956 
1958 
1960 
1964 
1964 
1965 
1967 
1967 


1974 


1974 
1977 
1981 
1983 
1986 
1994 
1995 
1998 
2003 


James Chadwick discovers the neutron. 

John Cockcroft and Ernest Walton produce the first nuclear reaction using a high-voltage accelerator. 
Ernest Lawrence produces first cyclotron for studying nuclear reactions. 

Irene and Frédéric Joliot-Curie discover artificially induced radioactivity. 

Hideki Yukawa proposes existence of medium-mass particles (mesons). 

Otto Hahn, Fritz Strassmann, Lise Meitner, and Otto Frisch discover nuclear fission. 

Hans Bethe proposes thermonuclear fusion reactions as the source of energy in stars. 

Edwin McMillan, Glenn Seaborg, and colleagues produce first synthetic transuranic elements. 
Enrico Fermi and colleagues build first nuclear fission reactor. 

Detonation of first fission bomb in New Mexico desert. 

George Gamow proposes big-bang cosmology. 

John Bardeen, Walter Brattain, and William Shockley demonstrate first transistor. 

Detonation of first thermonuclear fusion bomb at Eniwetok atoll. 

Frederick Reines and Clyde Cowan demonstrate experimental evidence for existence of neutrino. 
Rudolf L. Mossbauer demonstrates recoilless emission of gamma rays. 

Theodore Maiman constructs first ruby laser; Ali Javan constructs first helium-neon laser. 

Allan R. Sandage discovers first quasar. 

Murray Gell-Mann and George Zweig independently introduce three-quark model of elementary particles. 
Arno Penzias and Robert Wilson discover cosmic microwave background radiation. 

Jocelyn Bell and Anthony Hewish discover first pulsar. 


Steven Weinberg and Abdus Salam independently propose a unified theory linking the weak and 
electromagnetic interactions. 


Burton Richter and Samuel Ting and co-workers independently discover first evidence of fourth quark 
(charm). 


Joseph Taylor and Russell Hulse discover first binary pulsar. 

Leon Lederman and colleagues discover new particle showing evidence for fifth quark (bottom). 
Gerd Binnig and Heinrich Rohrer invent scanning-tunneling electron microscope. 

Carlo Rubbia and co-workers at CERN discover W and Z particles. 

J. Georg Bednorz and Karl Alex Miiller produce first high-temperature superconductors. 
Investigators at Fermilab discover evidence for sixth quark (top). 

Eric Cornell and Carl Wieman produce first Bose-Einstein condensation. 

Discovery of neutrino oscillations shows that neutrinos have small but nonzero mass. 


WMAP satellite data reveal age and composition of universe. 


Units 





























Quantity Quantity 
Unit Abbreviation Measured Unit Abbreviation Measured 
gram g mass coulomb C electric charge 
meter m length ampere A electric current 
second S time volt V electric potential 
newton N force ohm Q electric resistance 
joule J energy tesla T magnetic field 
watt W power atomic mass unit u mass 
electron-volt eV energy curie Ci activity 
hertz Hz frequency barn b cross section 
kelvin K temperature 
Prefixes of Units 

Prefix Abbreviation Value Prefix Abbreviation Value 

atto a ign” centi c 10-7 

femto f 1o-* kilo K 10° 

pico p ge mega M 10° 

nano n 10~° giga G 10° 

micro LL 1o-* tera T 10"? 

milli m io-? peta P i¢* 

Some Commonly Used 
Constants and Conversion Factors 
(see Appendix A for a more complete list) 
Speed of light c = 2.998 x 108 m/s 


2 





Arey 


Electronic charge 
Boltzmann constant 
Planck’s constant 
Avogadro’s constant 
Electron mass 
Proton mass 
Neutron mass 

Bohr radius 


Thermal energy 
hc = 1240 eV- nm (MeV: fm) 


Hydrogen ionization energy 


© = 1.440 eV- nm (MeV - fm) 


e= 1.602 x 10°? C 

k = 1.381 x 10-2 J/K = 8.617 x 1075 eV/K 
h = 6.626 x 107-34 J -s = 4.136 x 10- eV-s 
Na = 6.022 x 1073 mole ~! 

m, = 5.49 x 10-4u = 0.511 MeV/c? 

m, = 1.007276 u = 938.3 MeV/c? 

m,, = 1.008665 u = 939.6 MeV/c? 

dy = 0.0529 nm 

13.6eV 

kT = 0.02525eV = FeV (T = 293K) 

hc = 197 eV-nm (MeV - fm) 

lu = 931.5 MeV/c? 

leV = 1.602 x 107!°J 
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